Alexandre Lebas - Océane Deletrez
This case study is on using decision trees to predict customer response and optimize profit. To improve customer contact process and maximize the amount of profit, decision trees were built with R to model customer contact history and predict the response of customers. And then the customers can be prioritized to contact based on the prediction, so that profit can be maximized, given a limited amount of time, cost and human resources. (File Cup98LRN, whippet, directory Project – B - 2019) Keywords: Decision tree, prediction, profit optimization Main steps:
# Alexandre Lebas - Océane Deletrez
# Inspiré du livre de Yanchang Zhao
data.train = read.csv(file = "cup98LRN.txt")
dim(data.train)
Warning message in scan(file = file, what = what, sep = sep, quote = quote, dec = dec, : "caractère(s) 'nul' au milieu de l'entrée"
data.val = read.csv(file = "cup98VAL.txt")
dim(data.val)
Warning message in scan(file = file, what = what, sep = sep, quote = quote, dec = dec, : "caractère(s) 'nul' au milieu de l'entrée"
On remarque que dans les 2 jeux de données il y a plus de 400 variables. Entre le jeu dé d'entrainement et le jeu de validation il y a deux variables en plus ce qui signifie qu'il y a deux valeurs à prédire. Ce qui semble logique serait de prévoir si une personne a donné ou non et combien elle a donné.
# Nous permet d'avoir une description de toutes les variables
install.packages("Hmisc")
Installation du package dans 'C:/Users/alex5/OneDrive/Documents/R/win-library/4.1' (car 'lib' n'est pas spécifié)
Une version binaire est disponible mais la version du source est plus
récente:
binary source needs_compilation
Hmisc 4.5-0 4.6-0 TRUE
Binaries will be installed
package 'Hmisc' successfully unpacked and MD5 sums checked
The downloaded binary packages are in
C:\Users\alex5\AppData\Local\Temp\RtmpgnoXDC\downloaded_packages
library(Hmisc)
describe(data.train)
Le chargement a nécessité le package : lattice
Le chargement a nécessité le package : survival
Le chargement a nécessité le package : Formula
Le chargement a nécessité le package : ggplot2
Attachement du package : 'Hmisc'
Les objets suivants sont masqués depuis 'package:base':
format.pval, units
data.train
481 Variables 10000 Observations
--------------------------------------------------------------------------------
ODATEDW
n missing distinct Info Mean Gmd .05 .10
10000 0 16 0.986 9141 390.7 8601 8601
.25 .50 .75 .90 .95
8801 9201 9501 9601 9601
lowest : 8601 8701 8801 8804 8810, highest: 9401 9501 9509 9601 9701
Value 8601 8701 8801 8804 8810 8901 9001 9101 9201 9301 9302
Frequency 1544 369 687 1 1 534 803 925 787 391 1
Proportion 0.154 0.037 0.069 0.000 0.000 0.053 0.080 0.092 0.079 0.039 0.000
Value 9401 9501 9509 9601 9701
Frequency 1325 1595 2 1034 1
Proportion 0.132 0.160 0.000 0.103 0.000
--------------------------------------------------------------------------------
OSOURCE
n missing distinct
9902 98 516
lowest : ABC ABD ABW ACB ACC, highest: WWF YAL YKA YKD YNF
--------------------------------------------------------------------------------
TCODE
n missing distinct Info Mean Gmd .05 .10
10000 0 24 0.895 51.57 100.8 0 0
.25 .50 .75 .90 .95
0 1 2 28 28
lowest : 0 1 2 3 4, highest: 1002 4002 13002 28028 72002
Value 0 200 1000 4000 13000 28000 72000
Frequency 9794 2 193 2 1 7 1
Proportion 0.979 0.000 0.019 0.000 0.000 0.001 0.000
For the frequency table, variable is rounded to the nearest 200
--------------------------------------------------------------------------------
STATE
n missing distinct
10000 0 53
lowest : AA AE AK AL AP, highest: VT WA WI WV WY
--------------------------------------------------------------------------------
ZIP
n missing distinct
10000 0 6323
lowest : 01876 02038 02124 04364 05033, highest: 99803 99821 99824 99833 99901
--------------------------------------------------------------------------------
MAILCODE
n missing distinct value
151 9849 1 B
Value B
Frequency 151
Proportion 1
--------------------------------------------------------------------------------
PVASTATE
n missing distinct
160 9840 2
Value E P
Frequency 1 159
Proportion 0.006 0.994
--------------------------------------------------------------------------------
DOB
n missing distinct Info Mean Gmd .05 .10
10000 0 779 0.985 2722 2430 0 0
.25 .50 .75 .90 .95
301 2607 4601 5601 6111
lowest : 0 1 2 101 103, highest: 7801 7906 8004 8206 8408
--------------------------------------------------------------------------------
NOEXCH
n missing distinct
10000 0 3
Value 0 1 X
Frequency 9961 34 5
Proportion 0.996 0.003 0.000
--------------------------------------------------------------------------------
RECINHSE
n missing distinct value
697 9303 1 X
Value X
Frequency 697
Proportion 1
--------------------------------------------------------------------------------
RECP3
n missing distinct value
193 9807 1 X
Value X
Frequency 193
Proportion 1
--------------------------------------------------------------------------------
RECPGVG
n missing distinct value
11 9989 1 X
Value X
Frequency 11
Proportion 1
--------------------------------------------------------------------------------
RECSWEEP
n missing distinct value
178 9822 1 X
Value X
Frequency 178
Proportion 1
--------------------------------------------------------------------------------
MDMAUD
n missing distinct
10000 0 11
lowest : C1CM C1MM C2CM D1CM D2CM, highest: I1LM I2MM L1CM L2CM XXXX
Value C1CM C1MM C2CM D1CM D2CM I1CM I1LM I2MM L1CM L2CM XXXX
Frequency 2 1 4 2 2 3 1 1 3 1 9980
Proportion 0.000 0.000 0.000 0.000 0.000 0.000 0.000 0.000 0.000 0.000 0.998
--------------------------------------------------------------------------------
DOMAIN
n missing distinct
9781 219 16
lowest : C1 C2 C3 R1 R2, highest: T3 U1 U2 U3 U4
Value C1 C2 C3 R1 R2 R3 S1 S2 S3 T1 T2
Frequency 630 868 566 129 1450 515 1200 898 221 523 1268
Proportion 0.064 0.089 0.058 0.013 0.148 0.053 0.123 0.092 0.023 0.053 0.130
Value T3 U1 U2 U3 U4
Frequency 203 467 337 274 232
Proportion 0.021 0.048 0.034 0.028 0.024
--------------------------------------------------------------------------------
CLUSTER
n missing distinct Info Mean Gmd .05 .10
9781 219 53 0.999 27.85 16.65 4 8
.25 .50 .75 .90 .95
15 28 40 47 50
lowest : 1 2 3 4 5, highest: 49 50 51 52 53
--------------------------------------------------------------------------------
AGE
n missing distinct Info Mean Gmd .05 .10
7544 2456 83 1 61.74 19.14 35 39
.25 .50 .75 .90 .95
48 62 75 83 87
lowest : 13 16 18 19 20, highest: 94 95 96 97 98
--------------------------------------------------------------------------------
AGEFLAG
n missing distinct
6942 3058 2
Value E I
Frequency 6095 847
Proportion 0.878 0.122
--------------------------------------------------------------------------------
HOMEOWNR
n missing distinct
7721 2279 2
Value H U
Frequency 5504 2217
Proportion 0.713 0.287
--------------------------------------------------------------------------------
CHILD03
n missing distinct
124 9876 3
Value B F M
Frequency 5 25 94
Proportion 0.040 0.202 0.758
--------------------------------------------------------------------------------
CHILD07
n missing distinct
149 9851 3
Value B F M
Frequency 8 38 103
Proportion 0.054 0.255 0.691
--------------------------------------------------------------------------------
CHILD12
n missing distinct
197 9803 3
Value B F M
Frequency 18 59 120
Proportion 0.091 0.299 0.609
--------------------------------------------------------------------------------
CHILD18
n missing distinct
285 9715 3
Value B F M
Frequency 20 119 146
Proportion 0.070 0.418 0.512
--------------------------------------------------------------------------------
NUMCHLD
n missing distinct Info Mean Gmd
1322 8678 5 0.733 1.53 0.7571
lowest : 1 2 3 4 5, highest: 1 2 3 4 5
Value 1 2 3 4 5
Frequency 834 329 113 38 8
Proportion 0.631 0.249 0.085 0.029 0.006
--------------------------------------------------------------------------------
INCOME
n missing distinct Info Mean Gmd
7819 2181 7 0.974 3.857 2.111
lowest : 1 2 3 4 5, highest: 3 4 5 6 7
Value 1 2 3 4 5 6 7
Frequency 960 1441 893 1327 1632 793 773
Proportion 0.123 0.184 0.114 0.170 0.209 0.101 0.099
--------------------------------------------------------------------------------
GENDER
n missing distinct
9725 275 5
lowest : C F J M U, highest: C F J M U
Value C F J M U
Frequency 1 5339 47 4164 174
Proportion 0.000 0.549 0.005 0.428 0.018
--------------------------------------------------------------------------------
WEALTH1
n missing distinct Info Mean Gmd .05 .10
5366 4634 10 0.987 5.372 3.127 1 1
.25 .50 .75 .90 .95
3 6 8 9 9
lowest : 0 1 2 3 4, highest: 5 6 7 8 9
Value 0 1 2 3 4 5 6 7 8 9
Frequency 260 338 421 481 529 523 610 641 742 821
Proportion 0.048 0.063 0.078 0.090 0.099 0.097 0.114 0.119 0.138 0.153
--------------------------------------------------------------------------------
HIT
n missing distinct Info Mean Gmd .05 .10
10000 0 58 0.802 3.186 5.169 0 0
.25 .50 .75 .90 .95
0 0 3 10 16
lowest : 0 1 2 3 4, highest: 53 59 60 65 240
--------------------------------------------------------------------------------
MBCRAFT
n missing distinct Info Mean Gmd
4490 5510 6 0.3 0.1465 0.2665
lowest : 0 1 2 3 4, highest: 1 2 3 4 6
Value 0 1 2 3 4 6
Frequency 3986 386 88 26 3 1
Proportion 0.888 0.086 0.020 0.006 0.001 0.000
--------------------------------------------------------------------------------
MBGARDEN
n missing distinct Info Mean Gmd
4490 5510 4 0.139 0.05523 0.1057
Value 0 1 2 3
Frequency 4271 194 21 4
Proportion 0.951 0.043 0.005 0.001
--------------------------------------------------------------------------------
MBBOOKS
n missing distinct Info Mean Gmd .05 .10
4490 5510 10 0.847 1.066 1.488 0 0
.25 .50 .75 .90 .95
0 0 1 3 5
lowest : 0 1 2 3 4, highest: 5 6 7 8 9
Value 0 1 2 3 4 5 6 7 8 9
Frequency 2314 1090 481 235 142 90 49 32 18 39
Proportion 0.515 0.243 0.107 0.052 0.032 0.020 0.011 0.007 0.004 0.009
--------------------------------------------------------------------------------
MBCOLECT
n missing distinct Info Mean Gmd
4484 5516 4 0.135 0.05531 0.1061
Value 0 1 2 3
Frequency 4273 181 23 7
Proportion 0.953 0.040 0.005 0.002
--------------------------------------------------------------------------------
MAGFAML
n missing distinct Info Mean Gmd
4490 5510 9 0.645 0.4327 0.669
lowest : 0 1 2 3 4, highest: 4 5 6 7 9
Value 0 1 2 3 4 5 6 7 9
Frequency 3151 927 279 94 29 4 4 1 1
Proportion 0.702 0.206 0.062 0.021 0.006 0.001 0.001 0.000 0.000
--------------------------------------------------------------------------------
MAGFEM
n missing distinct Info Mean Gmd
4490 5510 5 0.283 0.1209 0.2195
lowest : 0 1 2 3 4, highest: 0 1 2 3 4
Value 0 1 2 3 4
Frequency 4018 413 48 10 1
Proportion 0.895 0.092 0.011 0.002 0.000
--------------------------------------------------------------------------------
MAGMALE
n missing distinct Info Mean Gmd
4490 5510 4 0.175 0.06837 0.1289
Value 0 1 2 3
Frequency 4210 256 21 3
Proportion 0.938 0.057 0.005 0.001
--------------------------------------------------------------------------------
PUBGARDN
n missing distinct Info Mean Gmd
4490 5510 6 0.271 0.1425 0.2631
lowest : 0 1 2 3 4, highest: 1 2 3 4 5
Value 0 1 2 3 4 5
Frequency 4041 313 97 26 10 3
Proportion 0.900 0.070 0.022 0.006 0.002 0.001
--------------------------------------------------------------------------------
PUBCULIN
n missing distinct Info Mean Gmd
4490 5510 5 0.3 0.1343 0.2427
lowest : 0 1 2 3 4, highest: 0 1 2 3 4
Value 0 1 2 3 4
Frequency 3985 424 67 11 3
Proportion 0.888 0.094 0.015 0.002 0.001
--------------------------------------------------------------------------------
PUBHLTH
n missing distinct Info Mean Gmd .05 .10
4490 5510 10 0.72 0.694 1.062 0 0
.25 .50 .75 .90 .95
0 0 1 2 3
lowest : 0 1 2 3 4, highest: 5 6 7 8 9
Value 0 1 2 3 4 5 6 7 8 9
Frequency 2913 817 350 201 112 58 24 8 5 2
Proportion 0.649 0.182 0.078 0.045 0.025 0.013 0.005 0.002 0.001 0.000
--------------------------------------------------------------------------------
PUBDOITY
n missing distinct Info Mean Gmd
4490 5510 9 0.362 0.2359 0.4265
lowest : 0 1 2 3 4, highest: 4 5 6 7 8
Value 0 1 2 3 4 5 6 7 8
Frequency 3864 398 134 36 27 19 6 2 4
Proportion 0.861 0.089 0.030 0.008 0.006 0.004 0.001 0.000 0.001
--------------------------------------------------------------------------------
PUBNEWFN
n missing distinct Info Mean Gmd .05 .10
4490 5510 10 0.509 0.3737 0.6393 0 0
.25 .50 .75 .90 .95
0 0 0 1 2
lowest : 0 1 2 3 4, highest: 5 6 7 8 9
Value 0 1 2 3 4 5 6 7 8 9
Frequency 3538 569 216 78 44 25 8 6 2 4
Proportion 0.788 0.127 0.048 0.017 0.010 0.006 0.002 0.001 0.000 0.001
--------------------------------------------------------------------------------
PUBPHOTO
n missing distinct Info Sum Mean Gmd
4490 5510 2 0.015 23 0.005122 0.01019
--------------------------------------------------------------------------------
PUBOPP
n missing distinct Info Mean Gmd .05 .10
4490 5510 10 0.337 0.245 0.4506 0 0
.25 .50 .75 .90 .95
0 0 0 1 1
lowest : 0 1 2 3 4, highest: 5 6 7 8 9
Value 0 1 2 3 4 5 6 7 8 9
Frequency 3914 364 91 68 11 7 12 3 2 18
Proportion 0.872 0.081 0.020 0.015 0.002 0.002 0.003 0.001 0.000 0.004
--------------------------------------------------------------------------------
DATASRCE
n missing distinct Info Mean Gmd
7821 2179 3 0.766 2.487 0.6647
Value 1 2 3
Frequency 780 2455 4586
Proportion 0.100 0.314 0.586
--------------------------------------------------------------------------------
MALEMILI
n missing distinct Info Mean Gmd .05 .10
10000 0 58 0.533 1.034 1.894 0 0
.25 .50 .75 .90 .95
0 0 0 2 4
lowest : 0 1 2 3 4, highest: 92 93 94 96 97
--------------------------------------------------------------------------------
MALEVET
n missing distinct Info Mean Gmd .05 .10
10000 0 81 0.999 30.56 12.56 10 17
.25 .50 .75 .90 .95
25 31 37 44 49
lowest : 0 1 2 3 4, highest: 78 79 82 84 85
--------------------------------------------------------------------------------
VIETVETS
n missing distinct Info Mean Gmd .05 .10
10000 0 91 0.999 29.84 16.82 0 11
.25 .50 .75 .90 .95
21 30 39 49 55
lowest : 0 1 2 3 4, highest: 88 90 91 93 99
--------------------------------------------------------------------------------
WWIIVETS
n missing distinct Info Mean Gmd .05 .10
10000 0 98 1 32.56 19.64 0 10
.25 .50 .75 .90 .95
21 32 43 55 63
lowest : 0 1 2 3 4, highest: 94 95 96 97 99
--------------------------------------------------------------------------------
LOCALGOV
n missing distinct Info Mean Gmd .05 .10
10000 0 37 0.993 6.867 4.62 0 2
.25 .50 .75 .90 .95
4 6 9 12 15
lowest : 0 1 2 3 4, highest: 34 38 43 44 57
--------------------------------------------------------------------------------
STATEGOV
n missing distinct Info Mean Gmd .05 .10
10000 0 48 0.988 4.592 4.78 0 0
.25 .50 .75 .90 .95
1 3 6 10 14
lowest : 0 1 2 3 4, highest: 45 46 48 50 58
--------------------------------------------------------------------------------
FEDGOV
n missing distinct Info Mean Gmd .05 .10
10000 0 47 0.975 3.182 3.623 0 0
.25 .50 .75 .90 .95
1 2 4 7 10
lowest : 0 1 2 3 4, highest: 44 46 47 49 50
--------------------------------------------------------------------------------
SOLP3
n missing distinct Info Mean Gmd
18 9982 3 0.793 4.111 5.699
Value 0 1 12
Frequency 10 2 6
Proportion 0.556 0.111 0.333
--------------------------------------------------------------------------------
SOLIH
n missing distinct Info Mean Gmd
650 9350 5 0.177 11.3 1.321
lowest : 0 1 2 4 12, highest: 0 1 2 4 12
Value 0 1 2 4 12
Frequency 23 2 15 1 609
Proportion 0.035 0.003 0.023 0.002 0.937
--------------------------------------------------------------------------------
MAJOR
n missing distinct value
20 9980 1 X
Value X
Frequency 20
Proportion 1
--------------------------------------------------------------------------------
WEALTH2
n missing distinct Info Mean Gmd .05 .10
5423 4577 10 0.989 4.982 3.2 0 1
.25 .50 .75 .90 .95
3 5 7 9 9
lowest : 0 1 2 3 4, highest: 5 6 7 8 9
Value 0 1 2 3 4 5 6 7 8 9
Frequency 333 419 511 554 552 573 552 613 604 712
Proportion 0.061 0.077 0.094 0.102 0.102 0.106 0.102 0.113 0.111 0.131
--------------------------------------------------------------------------------
GEOCODE
n missing distinct Info Mean Gmd
1556 8444 7 0.967 6.77 5.088
lowest : 1 2 3 4 5, highest: 3 4 5 12 14
Value 1 2 3 4 5 12 14
Frequency 112 136 339 182 203 393 191
Proportion 0.072 0.087 0.218 0.117 0.130 0.253 0.123
--------------------------------------------------------------------------------
COLLECT1
n missing distinct value
585 9415 1 Y
Value Y
Frequency 585
Proportion 1
--------------------------------------------------------------------------------
VETERANS
n missing distinct value
1085 8915 1 Y
Value Y
Frequency 1085
Proportion 1
--------------------------------------------------------------------------------
BIBLE
n missing distinct value
988 9012 1 Y
Value Y
Frequency 988
Proportion 1
--------------------------------------------------------------------------------
CATLG
n missing distinct value
850 9150 1 Y
Value Y
Frequency 850
Proportion 1
--------------------------------------------------------------------------------
HOMEE
n missing distinct value
102 9898 1 Y
Value Y
Frequency 102
Proportion 1
--------------------------------------------------------------------------------
PETS
n missing distinct value
1541 8459 1 Y
Value Y
Frequency 1541
Proportion 1
--------------------------------------------------------------------------------
CDPLAY
n missing distinct value
1325 8675 1 Y
Value Y
Frequency 1325
Proportion 1
--------------------------------------------------------------------------------
STEREO
n missing distinct value
1371 8629 1 Y
Value Y
Frequency 1371
Proportion 1
--------------------------------------------------------------------------------
PCOWNERS
n missing distinct value
1122 8878 1 Y
Value Y
Frequency 1122
Proportion 1
--------------------------------------------------------------------------------
PHOTO
n missing distinct value
523 9477 1 Y
Value Y
Frequency 523
Proportion 1
--------------------------------------------------------------------------------
CRAFTS
n missing distinct value
919 9081 1 Y
Value Y
Frequency 919
Proportion 1
--------------------------------------------------------------------------------
FISHER
n missing distinct value
762 9238 1 Y
Value Y
Frequency 762
Proportion 1
--------------------------------------------------------------------------------
GARDENIN
n missing distinct value
1387 8613 1 Y
Value Y
Frequency 1387
Proportion 1
--------------------------------------------------------------------------------
BOATS
n missing distinct value
208 9792 1 Y
Value Y
Frequency 208
Proportion 1
--------------------------------------------------------------------------------
WALKER
n missing distinct value
1054 8946 1 Y
Value Y
Frequency 1054
Proportion 1
--------------------------------------------------------------------------------
KIDSTUFF
n missing distinct value
160 9840 1 Y
Value Y
Frequency 160
Proportion 1
--------------------------------------------------------------------------------
CARDS
n missing distinct value
104 9896 1 Y
Value Y
Frequency 104
Proportion 1
--------------------------------------------------------------------------------
PLATES
n missing distinct value
47 9953 1 Y
Value Y
Frequency 47
Proportion 1
--------------------------------------------------------------------------------
LIFESRC
n missing distinct Info Mean Gmd
4398 5602 3 0.855 2.038 0.7708
Value 1 2 3
Frequency 1066 2100 1232
Proportion 0.242 0.477 0.280
--------------------------------------------------------------------------------
PEPSTRFL
n missing distinct value
4653 5347 1 X
Value X
Frequency 4653
Proportion 1
--------------------------------------------------------------------------------
POP901
n missing distinct Info Mean Gmd .05 .10
10000 0 4320 1 3279 3790 529 691
.25 .50 .75 .90 .95
987 1566 3046 6303 10811
lowest : 0 3 4 6 10, highest: 80599 80680 90832 96371 98701
--------------------------------------------------------------------------------
POP902
n missing distinct Info Mean Gmd .05 .10
10000 0 2259 1 869.6 1000 132 180
.25 .50 .75 .90 .95
263 422 824 1723 2946
lowest : 0 1 2 3 5, highest: 19094 20063 20621 21703 23766
--------------------------------------------------------------------------------
POP903
n missing distinct Info Mean Gmd .05 .10
10000 0 2659 1 1229 1413 203 269
.25 .50 .75 .90 .95
374 586 1142 2381 4032
lowest : 0 1 2 3 4, highest: 27674 28633 29306 30492 31167
--------------------------------------------------------------------------------
POP90C1
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.796 58.98 47.6 0 0
.25 .50 .75 .90 .95
0 99 99 99 99
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
POP90C2
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.484 13.5 23.11 0 0
.25 .50 .75 .90 .95
0 0 0 83 99
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
POP90C3
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.779 25.84 37.63 0 0
.25 .50 .75 .90 .95
0 0 52 99 99
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
POP90C4
n missing distinct Info Mean Gmd .05 .10
10000 0 59 0.981 48.19 3.969 43 45
.25 .50 .75 .90 .95
47 49 50 51 52
lowest : 0 14 19 21 27, highest: 79 87 89 91 99
--------------------------------------------------------------------------------
POP90C5
n missing distinct Info Mean Gmd .05 .10
10000 0 58 0.981 50.92 4.018 47 49
.25 .50 .75 .90 .95
50 51 53 55 56
lowest : 0 9 11 13 21, highest: 72 73 79 81 86
--------------------------------------------------------------------------------
ETH1
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.994 84.79 18.75 36 60
.25 .50 .75 .90 .95
82 93 98 99 99
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
ETH2
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.958 7.57 11.93 0 0
.25 .50 .75 .90 .95
0 1 6 21 40
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
ETH3
n missing distinct Info Mean Gmd .05 .10
10000 0 43 0.715 0.7731 1.265 0 0
.25 .50 .75 .90 .95
0 0 1 1 2
lowest : 0 1 2 3 4, highest: 92 95 96 98 99
--------------------------------------------------------------------------------
ETH4
n missing distinct Info Mean Gmd .05 .10
10000 0 74 0.924 2.827 4.27 0 0
.25 .50 .75 .90 .95
0 1 2 7 12
lowest : 0 1 2 3 4, highest: 85 87 89 90 91
--------------------------------------------------------------------------------
ETH5
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.979 7.584 10.75 0.00 0.00
.25 .50 .75 .90 .95
1.00 2.00 7.00 20.00 35.05
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
ETH6
n missing distinct Info Mean Gmd .05 .10
10000 0 12 0.395 0.2157 0.3806 0 0
.25 .50 .75 .90 .95
0 0 0 1 1
lowest : 0 1 2 3 4, highest: 7 8 9 10 22
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 8450 1200 226 71 28 8 6 4 2 2 2
Proportion 0.845 0.120 0.023 0.007 0.003 0.001 0.001 0.000 0.000 0.000 0.000
Value 22
Frequency 1
Proportion 0.000
--------------------------------------------------------------------------------
ETH7
n missing distinct Info Mean Gmd .05 .10
10000 0 36 0.428 0.3643 0.6632 0 0
.25 .50 .75 .90 .95
0 0 0 1 2
lowest : 0 1 2 3 4, highest: 46 53 55 57 59
--------------------------------------------------------------------------------
ETH8
n missing distinct Info Mean Gmd .05 .10
10000 0 38 0.523 0.594 1.065 0 0
.25 .50 .75 .90 .95
0 0 0 1 2
lowest : 0 1 2 3 4, highest: 36 38 40 45 61
--------------------------------------------------------------------------------
ETH9
n missing distinct Info Mean Gmd .05 .10
10000 0 35 0.492 0.5407 0.9726 0 0
.25 .50 .75 .90 .95
0 0 0 1 3
lowest : 0 1 2 3 4, highest: 42 43 48 56 58
--------------------------------------------------------------------------------
ETH10
n missing distinct Info Mean Gmd .05 .10
10000 0 18 0.362 0.2301 0.4171 0 0
.25 .50 .75 .90 .95
0 0 0 1 1
lowest : 0 1 2 3 4, highest: 14 17 18 22 28
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 8602 1000 228 74 32 24 14 7 6 2 3
Proportion 0.860 0.100 0.023 0.007 0.003 0.002 0.001 0.001 0.001 0.000 0.000
Value 12 13 14 17 18 22 28
Frequency 1 1 1 1 1 2 1
Proportion 0.000 0.000 0.000 0.000 0.000 0.000 0.000
--------------------------------------------------------------------------------
ETH11
n missing distinct Info Mean Gmd .05 .10
10000 0 21 0.29 0.2146 0.4019 0 0
.25 .50 .75 .90 .95
0 0 0 1 1
lowest : 0 1 2 3 4, highest: 16 17 22 30 45
--------------------------------------------------------------------------------
ETH12
n missing distinct Info Mean Gmd .05 .10
10000 0 21 0.034 0.0566 0.1128 0 0
.25 .50 .75 .90 .95
0 0 0 0 0
lowest : 0 1 2 3 5, highest: 19 23 42 66 72
--------------------------------------------------------------------------------
ETH13
n missing distinct Info Mean Gmd .05 .10
10000 0 94 0.944 5.217 8.088 0 0
.25 .50 .75 .90 .95
0 1 4 14 26
lowest : 0 1 2 3 4, highest: 90 91 92 94 95
--------------------------------------------------------------------------------
ETH14
n missing distinct Info Mean Gmd .05 .10
10000 0 20 0.414 0.2821 0.505 0 0
.25 .50 .75 .90 .95
0 0 0 1 1
lowest : 0 1 2 3 4, highest: 16 17 20 26 28
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 8358 1225 186 91 51 32 11 11 8 2 6
Proportion 0.836 0.122 0.019 0.009 0.005 0.003 0.001 0.001 0.001 0.000 0.001
Value 11 12 13 14 16 17 20 26 28
Frequency 4 4 1 1 2 1 4 1 1
Proportion 0.000 0.000 0.000 0.000 0.000 0.000 0.000 0.000 0.000
--------------------------------------------------------------------------------
ETH15
n missing distinct Info Mean Gmd .05 .10
10000 0 46 0.217 0.387 0.7573 0 0
.25 .50 .75 .90 .95
0 0 0 0 1
lowest : 0 1 2 3 4, highest: 68 71 72 77 78
--------------------------------------------------------------------------------
ETH16
n missing distinct Info Mean Gmd .05 .10
10000 0 45 0.887 1.515 2.175 0 0
.25 .50 .75 .90 .95
0 1 2 4 5
lowest : 0 1 2 3 4, highest: 47 48 52 61 68
--------------------------------------------------------------------------------
AGE901
n missing distinct Info Mean Gmd .05 .10
10000 0 63 0.997 34.39 7.982 26 27
.25 .50 .75 .90 .95
30 33 37 42 48
lowest : 0 14 17 19 20, highest: 74 75 76 77 79
--------------------------------------------------------------------------------
AGE902
n missing distinct Info Mean Gmd .05 .10
10000 0 61 0.997 41.84 8.188 32 34
.25 .50 .75 .90 .95
38 41 45 51 56
lowest : 0 19 20 21 22, highest: 74 75 76 77 80
--------------------------------------------------------------------------------
AGE903
n missing distinct Info Mean Gmd .05 .10
10000 0 53 0.997 45.07 8.04 36 38
.25 .50 .75 .90 .95
41 44 48 54 59
lowest : 0 28 29 30 31, highest: 75 76 77 78 80
--------------------------------------------------------------------------------
AGE904
n missing distinct Info Mean Gmd .05 .10
10000 0 59 0.996 35.86 7.085 28 29
.25 .50 .75 .90 .95
32 35 39 43 47
lowest : 0 19 20 21 22, highest: 72 73 74 75 76
--------------------------------------------------------------------------------
AGE905
n missing distinct Info Mean Gmd .05 .10
10000 0 56 0.996 44.65 6.628 37 39
.25 .50 .75 .90 .95
41 45 48 51 54
lowest : 0 21 22 24 25, highest: 72 73 74 75 76
--------------------------------------------------------------------------------
AGE906
n missing distinct Info Mean Gmd .05 .10
10000 0 47 0.996 47.86 6.397 40 42
.25 .50 .75 .90 .95
45 48 51 54 57
lowest : 0 31 32 33 34, highest: 72 73 74 75 76
--------------------------------------------------------------------------------
AGE907
n missing distinct Info Mean Gmd .05 .10
10000 0 53 0.998 24.55 8.053 11 15
.25 .50 .75 .90 .95
21 26 29 33 35
lowest : 0 1 2 3 4, highest: 49 50 51 54 55
--------------------------------------------------------------------------------
CHIL1
n missing distinct Info Mean Gmd .05 .10
10000 0 66 0.998 39.56 8.412 29 32
.25 .50 .75 .90 .95
35 39 44 49 52
lowest : 0 11 13 15 16, highest: 75 78 83 86 99
--------------------------------------------------------------------------------
CHIL2
n missing distinct Info Mean Gmd .05 .10
10000 0 56 0.994 38.35 5.833 30 33
.25 .50 .75 .90 .95
36 39 42 44 45
lowest : 0 4 7 8 9, highest: 57 62 67 80 99
--------------------------------------------------------------------------------
CHIL3
n missing distinct Info Mean Gmd .05 .10
10000 0 56 0.995 21 5.582 13 15
.25 .50 .75 .90 .95
18 21 24 26 28
lowest : 0 3 4 5 6, highest: 67 68 73 75 99
--------------------------------------------------------------------------------
AGEC1
n missing distinct Info Mean Gmd .05 .10
10000 0 78 0.994 12.27 5.609 6 7
.25 .50 .75 .90 .95
9 11 14 18 21
lowest : 0 1 2 3 4, highest: 80 81 85 87 93
--------------------------------------------------------------------------------
AGEC2
n missing distinct Info Mean Gmd .05 .10
10000 0 60 0.998 22.17 8.371 11 13
.25 .50 .75 .90 .95
18 22 26 32 35
lowest : 0 1 2 3 4, highest: 55 56 58 59 60
--------------------------------------------------------------------------------
AGEC3
n missing distinct Info Mean Gmd .05 .10
10000 0 51 0.996 20.62 6.576 12 14
.25 .50 .75 .90 .95
17 20 24 28 31
lowest : 0 1 2 3 4, highest: 46 47 48 50 63
--------------------------------------------------------------------------------
AGEC4
n missing distinct Info Mean Gmd .05 .10
10000 0 36 0.993 14.05 4.66 8 9
.25 .50 .75 .90 .95
12 14 16 19 21
lowest : 0 1 2 3 4, highest: 31 32 33 34 37
--------------------------------------------------------------------------------
AGEC5
n missing distinct Info Mean Gmd .05 .10
10000 0 38 0.993 11.85 4.536 5 7
.25 .50 .75 .90 .95
9 12 14 17 19
lowest : 0 1 2 3 4, highest: 33 34 36 39 40
--------------------------------------------------------------------------------
AGEC6
n missing distinct Info Mean Gmd .05 .10
10000 0 53 0.996 10.49 6.025 3 4
.25 .50 .75 .90 .95
7 10 13 17 20
lowest : 0 1 2 3 4, highest: 49 50 52 54 61
--------------------------------------------------------------------------------
AGEC7
n missing distinct Info Mean Gmd .05 .10
10000 0 63 0.995 7.682 6.339 1 2
.25 .50 .75 .90 .95
3 6 10 15 19
lowest : 0 1 2 3 4, highest: 60 62 63 67 68
--------------------------------------------------------------------------------
CHILC1
n missing distinct Info Mean Gmd .05 .10
10000 0 45 0.994 16.15 5.19 10 11
.25 .50 .75 .90 .95
13 16 19 22 24
lowest : 0 1 3 4 5, highest: 42 44 50 60 99
--------------------------------------------------------------------------------
CHILC2
n missing distinct Info Mean Gmd .05 .10
10000 0 37 0.988 16.13 3.69 11 13
.25 .50 .75 .90 .95
14 16 18 20 21
lowest : 0 1 2 3 4, highest: 33 37 42 44 99
--------------------------------------------------------------------------------
CHILC3
n missing distinct Info Mean Gmd .05 .10
10000 0 53 0.991 31.63 4.882 25 27
.25 .50 .75 .90 .95
30 32 34 36 38
lowest : 0 1 2 4 5, highest: 54 56 57 67 99
--------------------------------------------------------------------------------
CHILC4
n missing distinct Info Mean Gmd .05 .10
10000 0 41 0.993 19.76 4.611 13 15
.25 .50 .75 .90 .95
18 20 22 24 26
lowest : 0 2 3 4 5, highest: 38 39 43 50 99
--------------------------------------------------------------------------------
CHILC5
n missing distinct Info Mean Gmd .05 .10
10000 0 70 0.991 15.24 4.781 9 11
.25 .50 .75 .90 .95
13 15 17 20 22
lowest : 0 1 2 3 4, highest: 83 89 90 97 99
--------------------------------------------------------------------------------
HHAGE1
n missing distinct Info Mean Gmd .05 .10
10000 0 98 0.999 24.51 13.82 7 10
.25 .50 .75 .90 .95
16 23 31 39 46
lowest : 0 1 2 3 4, highest: 93 94 95 96 99
--------------------------------------------------------------------------------
HHAGE2
n missing distinct Info Mean Gmd .05 .10
10000 0 69 0.997 9.393 7.474 1 2
.25 .50 .75 .90 .95
4 8 12 18 22
lowest : 0 1 2 3 4, highest: 69 70 89 91 99
--------------------------------------------------------------------------------
HHAGE3
n missing distinct Info Mean Gmd .05 .10
10000 0 97 0.999 22.21 13.64 5 7
.25 .50 .75 .90 .95
13 21 29 37 43
lowest : 0 1 2 3 4, highest: 93 94 95 96 99
--------------------------------------------------------------------------------
HHN1
n missing distinct Info Mean Gmd .05 .10
10000 0 92 0.999 22.89 12.54 8 10
.25 .50 .75 .90 .95
15 21 28 38 44
lowest : 0 1 2 3 4, highest: 90 91 93 95 99
--------------------------------------------------------------------------------
HHN2
n missing distinct Info Mean Gmd .05 .10
10000 0 74 0.998 33.5 8.479 22 25
.25 .50 .75 .90 .95
29 33 37 42 46
lowest : 0 5 7 9 10, highest: 76 80 81 82 85
--------------------------------------------------------------------------------
HHN3
n missing distinct Info Mean Gmd .05 .10
10000 0 90 0.999 42.7 16.09 16 24
.25 .50 .75 .90 .95
34 44 52 60 65
lowest : 0 1 2 3 4, highest: 85 86 87 89 99
--------------------------------------------------------------------------------
HHN4
n missing distinct Info Mean Gmd .05 .10
10000 0 75 0.999 25.6 12.23 7 12
.25 .50 .75 .90 .95
19 25 32 39 44
lowest : 0 1 2 3 4, highest: 70 71 72 76 78
--------------------------------------------------------------------------------
HHN5
n missing distinct Info Mean Gmd .05 .10
10000 0 56 0.996 10.49 6.538 2 4
.25 .50 .75 .90 .95
7 10 13 18 21
lowest : 0 1 2 3 4, highest: 54 55 56 59 61
--------------------------------------------------------------------------------
HHN6
n missing distinct Info Mean Gmd .05 .10
10000 0 39 0.977 3.916 3.309 0 1
.25 .50 .75 .90 .95
2 3 5 7 10
lowest : 0 1 2 3 4, highest: 36 40 41 43 44
--------------------------------------------------------------------------------
MARR1
n missing distinct Info Mean Gmd .05 .10
10000 0 86 0.999 57.9 13.66 33.95 42.00
.25 .50 .75 .90 .95
52.00 61.00 67.00 71.00 73.00
lowest : 0 2 3 5 7, highest: 85 87 88 90 99
--------------------------------------------------------------------------------
MARR2
n missing distinct Info Mean Gmd .05 .10
10000 0 39 0.994 10.76 4.928 4 6
.25 .50 .75 .90 .95
8 10 13 17 19
lowest : 0 1 2 3 4, highest: 35 37 44 50 52
--------------------------------------------------------------------------------
MARR3
n missing distinct Info Mean Gmd .05 .10
10000 0 47 0.993 7.449 4.908 2 3
.25 .50 .75 .90 .95
4 7 9 13 16
lowest : 0 1 2 3 4, highest: 43 44 62 67 73
--------------------------------------------------------------------------------
MARR4
n missing distinct Info Mean Gmd .05 .10
10000 0 84 0.997 23.01 8.993 13 15
.25 .50 .75 .90 .95
18 21 26 34 39
lowest : 0 2 3 4 5, highest: 88 89 93 94 97
--------------------------------------------------------------------------------
HHP1
n missing distinct Info Mean Gmd .05 .10
10000 0 326 1 185 53.59 110 132
.25 .50 .75 .90 .95
158 181 211 247 271
lowest : 0 50 53 54 55, highest: 428 445 446 456 460
--------------------------------------------------------------------------------
HHP2
n missing distinct Info Mean Gmd .05 .10
10000 0 318 1 259.5 51.98 183 206
.25 .50 .75 .90 .95
235 262 288 313 331
lowest : 0 100 105 107 109, highest: 468 469 476 490 494
--------------------------------------------------------------------------------
DW1
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.999 69.69 27.3 15 33
.25 .50 .75 .90 .95
57 75 89 99 99
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
DW2
n missing distinct Info Mean Gmd .05 .10
10000 0 100 1 65.82 29.03 10 25
.25 .50 .75 .90 .95
51 71 86 97 99
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
DW3
n missing distinct Info Mean Gmd .05 .10
10000 0 55 0.935 2.898 4.129 0 0
.25 .50 .75 .90 .95
0 1 3 8 12
lowest : 0 1 2 3 4, highest: 55 56 61 65 66
--------------------------------------------------------------------------------
DW4
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.994 19.59 24.12 0 0
.25 .50 .75 .90 .95
1 10 30 56 74
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
DW5
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.982 16.67 22.06 0 0
.25 .50 .75 .90 .95
0 6 25 51 69
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
DW6
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.955 13.18 18.78 0 0
.25 .50 .75 .90 .95
0 3 19 42 61
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
DW7
n missing distinct Info Mean Gmd .05 .10
10000 0 73 0.657 1.861 3.274 0 0
.25 .50 .75 .90 .95
0 0 1 5 10
lowest : 0 1 2 3 4, highest: 88 92 93 97 99
--------------------------------------------------------------------------------
DW8
n missing distinct Info Mean Gmd .05 .10
10000 0 55 0.523 1.198 2.167 0 0
.25 .50 .75 .90 .95
0 0 0 3 7
lowest : 0 1 2 3 4, highest: 73 75 77 83 88
--------------------------------------------------------------------------------
DW9
n missing distinct Info Mean Gmd .05 .10
10000 0 59 0.329 0.6562 1.258 0 0
.25 .50 .75 .90 .95
0 0 0 1 2
lowest : 0 1 2 3 4, highest: 71 77 92 97 99
--------------------------------------------------------------------------------
HV1
n missing distinct Info Mean Gmd .05 .10
10000 0 2453 1 1048 842.4 300.0 372.0
.25 .50 .75 .90 .95
515.8 736.0 1188.0 2159.6 2979.0
lowest : 0 75 98 109 120, highest: 5970 5976 5980 5983 5985
--------------------------------------------------------------------------------
HV2
n missing distinct Info Mean Gmd .05 .10
10000 0 2573 1 1117 872.3 334 409
.25 .50 .75 .90 .95
558 804 1300 2280 3137
lowest : 0 90 125 137 143, highest: 5928 5932 5933 5950 5955
--------------------------------------------------------------------------------
HV3
n missing distinct Info Mean Gmd .05 .10
10000 0 14 0.972 4.197 2.418 2 2
.25 .50 .75 .90 .95
3 4 5 7 9
lowest : 0 1 2 3 4, highest: 9 10 11 12 13
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 98 342 1863 2267 1912 1282 829 524 296 238 105
Proportion 0.010 0.034 0.186 0.227 0.191 0.128 0.083 0.052 0.030 0.024 0.011
Value 11 12 13
Frequency 101 125 18
Proportion 0.010 0.013 0.002
--------------------------------------------------------------------------------
HV4
n missing distinct Info Mean Gmd .05 .10
10000 0 14 0.977 3.86 2.426 1 1
.25 .50 .75 .90 .95
2 3 5 7 8
lowest : 0 1 2 3 4, highest: 9 10 11 12 13
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 136 1009 1967 1941 1715 1185 767 526 342 200 129
Proportion 0.014 0.101 0.197 0.194 0.172 0.118 0.077 0.053 0.034 0.020 0.013
Value 11 12 13
Frequency 62 19 2
Proportion 0.006 0.002 0.000
--------------------------------------------------------------------------------
HU1
n missing distinct Info Mean Gmd .05 .10
10000 0 100 1 69.35 23.33 23 37
.25 .50 .75 .90 .95
59 75 85 92 95
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HU2
n missing distinct Info Mean Gmd .05 .10
10000 0 100 1 29.73 22.59 5 8
.25 .50 .75 .90 .95
15 24 40 61 74
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HU3
n missing distinct Info Mean Gmd .05 .10
10000 0 82 0.995 89.89 10.26 69 80
.25 .50 .75 .90 .95
89 94 96 98 99
lowest : 0 17 18 20 21, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HU4
n missing distinct Info Mean Gmd .05 .10
10000 0 83 0.995 9.206 8.82 1 2
.25 .50 .75 .90 .95
3 6 11 19 28
lowest : 0 1 2 3 4, highest: 79 80 82 83 99
--------------------------------------------------------------------------------
HU5
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.968 13.48 18.94 0 0
.25 .50 .75 .90 .95
0 5 15 46 65
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HHD1
n missing distinct Info Mean Gmd .05 .10
10000 0 87 0.999 35.7 14.38 12 19
.25 .50 .75 .90 .95
28 36 44 51 56
lowest : 0 1 2 3 4, highest: 83 84 85 86 99
--------------------------------------------------------------------------------
HHD2
n missing distinct Info Mean Gmd .05 .10
10000 0 95 0.999 71.4 15.58 43 53
.25 .50 .75 .90 .95
65 75 81 86 89
lowest : 0 5 7 8 9, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HHD3
n missing distinct Info Mean Gmd .05 .10
10000 0 94 1 58.55 17.88 29 37
.25 .50 .75 .90 .95
49 61 70 77 81
lowest : 0 4 5 6 7, highest: 92 93 94 95 99
--------------------------------------------------------------------------------
HHD4
n missing distinct Info Mean Gmd .05 .10
10000 0 78 0.999 27.73 13.25 8 13
.25 .50 .75 .90 .95
20 28 35 43 48
lowest : 0 1 2 3 4, highest: 75 76 77 80 81
--------------------------------------------------------------------------------
HHD5
n missing distinct Info Mean Gmd .05 .10
10000 0 94 0.998 81.8 12.94 56 67
.25 .50 .75 .90 .95
79 86 90 93 94
lowest : 0 2 3 8 9, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HHD6
n missing distinct Info Mean Gmd .05 .10
10000 0 95 0.998 17.31 11.79 5 7
.25 .50 .75 .90 .95
10 14 21 31 41
lowest : 0 1 2 3 4, highest: 91 92 97 98 99
--------------------------------------------------------------------------------
HHD7
n missing distinct Info Mean Gmd .05 .10
10000 0 53 0.993 7.973 5.264 2 3
.25 .50 .75 .90 .95
5 7 10 14 17
lowest : 0 1 2 3 4, highest: 48 50 57 66 99
--------------------------------------------------------------------------------
HHD8
n missing distinct Info Mean Gmd .05 .10
10000 0 13 0.898 1.641 1.126 0 0
.25 .50 .75 .90 .95
1 1 2 3 4
lowest : 0 1 2 3 4, highest: 8 9 10 11 12
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 1046 3993 3306 1125 351 104 44 15 9 4 1
Proportion 0.105 0.399 0.331 0.112 0.035 0.010 0.004 0.002 0.001 0.000 0.000
Value 11 12
Frequency 1 1
Proportion 0.000 0.000
--------------------------------------------------------------------------------
HHD9
n missing distinct Info Mean Gmd .05 .10
10000 0 49 0.99 6.334 4.504 2 2
.25 .50 .75 .90 .95
4 5 8 11 15
lowest : 0 1 2 3 4, highest: 45 48 55 62 99
--------------------------------------------------------------------------------
HHD10
n missing distinct Info Mean Gmd .05 .10
10000 0 62 0.996 13.65 7.141 5 7
.25 .50 .75 .90 .95
9 12 16 22 27
lowest : 0 1 2 3 4, highest: 60 62 64 67 99
--------------------------------------------------------------------------------
HHD11
n missing distinct Info Mean Gmd .05 .10
10000 0 72 0.999 18.92 10.16 6 9
.25 .50 .75 .90 .95
12 18 24 31 36
lowest : 0 1 2 3 4, highest: 69 71 78 85 99
--------------------------------------------------------------------------------
HHD12
n missing distinct Info Mean Gmd .05 .10
10000 0 48 0.983 4.804 3.806 1 1
.25 .50 .75 .90 .95
2 4 6 9 12
lowest : 0 1 2 3 4, highest: 50 53 56 58 67
--------------------------------------------------------------------------------
ETHC1
n missing distinct Info Mean Gmd .05 .10
10000 0 42 0.998 16.76 7.561 3 7
.25 .50 .75 .90 .95
13 18 22 25 26
lowest : 0 1 2 3 4, highest: 37 38 39 41 46
--------------------------------------------------------------------------------
ETHC2
n missing distinct Info Mean Gmd .05 .10
10000 0 90 0.999 50.74 14.63 18 33
.25 .50 .75 .90 .95
46 54 60 64 66
lowest : 0 1 2 3 4, highest: 85 86 89 90 92
--------------------------------------------------------------------------------
ETHC3
n missing distinct Info Mean Gmd .05 .10
10000 0 97 0.999 17.32 12.12 3 5
.25 .50 .75 .90 .95
10 15 22 30 38
lowest : 0 1 2 3 4, highest: 93 94 95 96 97
--------------------------------------------------------------------------------
ETHC4
n missing distinct Info Mean Gmd .05 .10
10000 0 38 0.812 1.993 3.254 0 0
.25 .50 .75 .90 .95
0 0 1 6 11
lowest : 0 1 2 3 4, highest: 33 34 35 37 39
--------------------------------------------------------------------------------
ETHC5
n missing distinct Info Mean Gmd .05 .10
10000 0 72 0.929 4.653 7.331 0 0
.25 .50 .75 .90 .95
0 1 4 13 25
lowest : 0 1 2 3 4, highest: 67 68 71 75 83
--------------------------------------------------------------------------------
ETHC6
n missing distinct Info Mean Gmd .05 .10
10000 0 38 0.515 0.8284 1.495 0 0
.25 .50 .75 .90 .95
0 0 0 2 5
lowest : 0 1 2 3 4, highest: 40 42 43 54 81
--------------------------------------------------------------------------------
HVP1
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.922 13.13 21.19 0 0
.25 .50 .75 .90 .95
0 1 9 59 88
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HVP2
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.981 20.57 29.6 0 0
.25 .50 .75 .90 .95
0 3 26 85 96
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HVP3
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.998 34.77 39.41 0 1
.25 .50 .75 .90 .95
4 17 67 97 99
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HVP4
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.998 51.01 41.67 1 4
.25 .50 .75 .90 .95
15 48 92 99 99
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HVP5
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.979 73.92 31.36 14 25
.25 .50 .75 .90 .95
53 89 99 99 99
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HVP6
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.724 6.102 10.88 0 0
.25 .50 .75 .90 .95
0 0 1 16 49
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HUR1
n missing distinct Info Mean Gmd .05 .10
10000 0 72 0.983 4.94 6.197 0 0
.25 .50 .75 .90 .95
1 2 6 12 19
lowest : 0 1 2 3 4, highest: 76 78 89 95 99
--------------------------------------------------------------------------------
HUR2
n missing distinct Info Mean Gmd .05 .10
10000 0 100 1 45.55 24.32 11 18
.25 .50 .75 .90 .95
31 43 59 76 85
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
RHP1
n missing distinct Info Mean Gmd .05 .10
10000 0 72 0.999 52.84 11.31 37 42
.25 .50 .75 .90 .95
48 52 58 66 71
lowest : 0 5 8 11 13, highest: 80 81 82 83 84
--------------------------------------------------------------------------------
RHP2
n missing distinct Info Mean Gmd .05 .10
10000 0 74 0.999 53.64 10.9 38 43
.25 .50 .75 .90 .95
49 54 59 66 70
lowest : 0 10 13 14 15, highest: 83 84 85 86 88
--------------------------------------------------------------------------------
RHP3
n missing distinct Info Mean Gmd .05 .10
10000 0 20 0.975 14.13 2.567 11 12
.25 .50 .75 .90 .95
13 14 15 17 18
lowest : 0 10 11 12 13, highest: 24 25 26 29 30
Value 0 10 11 12 13 14 15 16 17 18 19
Frequency 89 406 442 939 1822 2229 1808 1068 525 283 137
Proportion 0.009 0.041 0.044 0.094 0.182 0.223 0.181 0.107 0.052 0.028 0.014
Value 20 21 22 23 24 25 26 29 30
Frequency 103 71 37 18 10 9 2 1 1
Proportion 0.010 0.007 0.004 0.002 0.001 0.001 0.000 0.000 0.000
--------------------------------------------------------------------------------
RHP4
n missing distinct Info Mean Gmd .05 .10
10000 0 16 0.859 4.399 1.095 3 3
.25 .50 .75 .90 .95
4 4 5 5 6
lowest : 0 1 2 3 4, highest: 11 12 13 14 15
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 92 27 120 979 4784 3124 517 171 88 36 24
Proportion 0.009 0.003 0.012 0.098 0.478 0.312 0.052 0.017 0.009 0.004 0.002
Value 11 12 13 14 15
Frequency 17 8 7 4 2
Proportion 0.002 0.001 0.001 0.000 0.000
--------------------------------------------------------------------------------
HUPA1
n missing distinct Info Mean Gmd .05 .10
10000 0 90 0.991 10.25 12.42 0 0
.25 .50 .75 .90 .95
1 6 15 27 36
lowest : 0 1 2 3 4, highest: 90 91 92 93 99
--------------------------------------------------------------------------------
HUPA2
n missing distinct Info Mean Gmd .05 .10
10000 0 99 0.897 9.324 14.33 0 0
.25 .50 .75 .90 .95
0 1 11 31 48
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HUPA3
n missing distinct Info Mean Gmd .05 .10
10000 0 99 0.868 8.782 13.1 0 0
.25 .50 .75 .90 .95
0 0 13 28 38
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HUPA4
n missing distinct Info Mean Gmd .05 .10
10000 0 70 0.998 11.17 8.028 2 3
.25 .50 .75 .90 .95
6 10 15 21 25
lowest : 0 1 2 3 4, highest: 82 89 92 98 99
--------------------------------------------------------------------------------
HUPA5
n missing distinct Info Mean Gmd .05 .10
10000 0 60 0.977 5.288 6.829 0 0
.25 .50 .75 .90 .95
0 2 7 14 20
lowest : 0 1 2 3 4, highest: 60 67 69 78 99
--------------------------------------------------------------------------------
HUPA6
n missing distinct Info Mean Gmd .05 .10
10000 0 99 0.944 11.18 16.17 0 0
.25 .50 .75 .90 .95
0 3 15 36 53
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HUPA7
n missing distinct Info Mean Gmd .05 .10
10000 0 37 0.799 1.641 2.493 0 0
.25 .50 .75 .90 .95
0 0 2 5 7
lowest : 0 1 2 3 4, highest: 35 37 38 41 99
--------------------------------------------------------------------------------
RP1
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.995 28.78 34.3 0.00 0.00
.25 .50 .75 .90 .95
2.00 13.00 53.25 83.00 92.00
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
RP2
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.999 42.16 39.54 0 1
.25 .50 .75 .90 .95
7 35 77 93 96
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
RP3
n missing distinct Info Mean Gmd .05 .10
10000 0 100 1 59.65 37.19 4 9
.25 .50 .75 .90 .95
29 69 91 97 98
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
RP4
n missing distinct Info Mean Gmd .05 .10
10000 0 99 0.999 76.45 25.54 23 37
.25 .50 .75 .90 .95
64 86 95 98 99
lowest : 0 2 3 4 5, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
MSA
n missing distinct Info Mean Gmd .05 .10
9989 11 257 0.989 3544 3269 0 0
.25 .50 .75 .90 .95
520 3360 5960 7480 8280
lowest : 0 40 120 200 220, highest: 9200 9260 9270 9340 9360
--------------------------------------------------------------------------------
ADI
n missing distinct Info Mean Gmd .05 .10
9989 11 183 0.999 189.2 155.1 13 15
.25 .50 .75 .90 .95
67 179 281 383 435
lowest : 0 3 9 11 13, highest: 613 621 625 627 645
--------------------------------------------------------------------------------
DMA
n missing distinct Info Mean Gmd .05 .10
9989 11 186 0.999 664.7 132.3 513 524
.25 .50 .75 .90 .95
561 635 803 819 839
lowest : 0 500 501 503 504, highest: 855 862 866 868 881
--------------------------------------------------------------------------------
IC1
n missing distinct Info Mean Gmd .05 .10
10000 0 806 1 337.9 167 147.0 179.0
.25 .50 .75 .90 .95
231.0 309.0 412.0 531.1 624.0
lowest : 0 50 54 56 59, highest: 1356 1368 1386 1395 1500
--------------------------------------------------------------------------------
IC2
n missing distinct Info Mean Gmd .05 .10
10000 0 865 1 384.7 176.9 185.0 219.0
.25 .50 .75 .90 .95
273.0 354.0 463.2 586.0 679.0
lowest : 0 50 54 56 59, highest: 1434 1442 1493 1496 1500
--------------------------------------------------------------------------------
IC3
n missing distinct Info Mean Gmd .05 .10
10000 0 830 1 384.6 169.6 197 225
.25 .50 .75 .90 .95
277 353 463 590 690
lowest : 0 25 41 62 68, highest: 1153 1157 1199 1208 1226
--------------------------------------------------------------------------------
IC4
n missing distinct Info Mean Gmd .05 .10
10000 0 877 1 428.1 181.1 225 259
.25 .50 .75 .90 .95
314 395 511 652 749
lowest : 0 71 75 80 85, highest: 1239 1246 1252 1272 1273
--------------------------------------------------------------------------------
IC5
n missing distinct Info Mean Gmd .05 .10
10000 0 7301 1 15546 7733 7462 8789
.25 .50 .75 .90 .95
10770 13626 17962 24128 29466
lowest : 0 1500 1972 2447 2582, highest: 87766 91191 94863 98040 105251
--------------------------------------------------------------------------------
IC6
n missing distinct Info Mean Gmd .05 .10
10000 0 89 0.999 21.63 16.11 3 5
.25 .50 .75 .90 .95
10 19 31 41 49
lowest : 0 1 2 3 4, highest: 84 85 96 97 99
--------------------------------------------------------------------------------
IC7
n missing distinct Info Mean Gmd .05 .10
10000 0 54 0.999 16.99 8.957 3 6
.25 .50 .75 .90 .95
11 17 22 27 30
lowest : 0 1 2 3 4, highest: 49 50 53 55 99
--------------------------------------------------------------------------------
IC8
n missing distinct Info Mean Gmd .05 .10
10000 0 46 0.997 15.78 6.99 5 8
.25 .50 .75 .90 .95
12 16 19 23 26
lowest : 0 1 2 3 4, highest: 42 44 53 64 99
--------------------------------------------------------------------------------
IC9
n missing distinct Info Mean Gmd .05 .10
10000 0 55 0.998 18.32 8.309 7 9
.25 .50 .75 .90 .95
13 18 23 28 31
lowest : 0 1 2 3 4, highest: 50 53 57 60 99
--------------------------------------------------------------------------------
IC10
n missing distinct Info Mean Gmd .05 .10
10000 0 56 0.999 15.92 10.89 3 5
.25 .50 .75 .90 .95
8 14 22 30 34
lowest : 0 1 2 3 4, highest: 51 52 55 60 61
--------------------------------------------------------------------------------
IC11
n missing distinct Info Mean Gmd .05 .10
10000 0 39 0.99 5.474 5.857 0 0
.25 .50 .75 .90 .95
1 4 8 14 17
lowest : 0 1 2 3 4, highest: 34 35 36 37 39
--------------------------------------------------------------------------------
IC12
n missing distinct Info Mean Gmd .05 .10
10000 0 28 0.922 2.164 2.958 0 0
.25 .50 .75 .90 .95
0 1 3 6 9
lowest : 0 1 2 3 4, highest: 23 24 26 27 31
--------------------------------------------------------------------------------
IC13
n missing distinct Info Mean Gmd .05 .10
10000 0 19 0.725 0.9079 1.438 0 0
.25 .50 .75 .90 .95
0 0 1 3 4
lowest : 0 1 2 3 4, highest: 14 15 16 17 21
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 6471 1515 846 427 242 152 111 69 39 45 30
Proportion 0.647 0.152 0.085 0.043 0.024 0.015 0.011 0.007 0.004 0.004 0.003
Value 11 12 13 14 15 16 17 21
Frequency 10 16 9 7 3 5 2 1
Proportion 0.001 0.002 0.001 0.001 0.000 0.000 0.000 0.000
--------------------------------------------------------------------------------
IC14
n missing distinct Info Mean Gmd .05 .10
10000 0 54 0.826 1.83 2.924 0 0
.25 .50 .75 .90 .95
0 0 2 5 8
lowest : 0 1 2 3 4, highest: 52 55 56 57 59
--------------------------------------------------------------------------------
IC15
n missing distinct Info Mean Gmd .05 .10
10000 0 80 0.999 14.88 13.15 0 2
.25 .50 .75 .90 .95
5 12 21 31 39
lowest : 0 1 2 3 4, highest: 80 84 85 95 99
--------------------------------------------------------------------------------
IC16
n missing distinct Info Mean Gmd .05 .10
10000 0 66 0.999 16 10.4 2 4
.25 .50 .75 .90 .95
9 16 22 28 31
lowest : 0 1 2 3 4, highest: 64 67 71 82 99
--------------------------------------------------------------------------------
IC17
n missing distinct Info Mean Gmd .05 .10
10000 0 58 0.998 16.51 8.47 4 7
.25 .50 .75 .90 .95
12 17 21 26 29
lowest : 0 1 2 3 4, highest: 62 64 68 73 99
--------------------------------------------------------------------------------
IC18
n missing distinct Info Mean Gmd .05 .10
10000 0 62 0.999 20.6 9.393 7 10
.25 .50 .75 .90 .95
15 21 26 31 34
lowest : 0 1 2 3 4, highest: 59 60 61 66 99
--------------------------------------------------------------------------------
IC19
n missing distinct Info Mean Gmd .05 .10
10000 0 61 0.999 18.68 12.11 3 6
.25 .50 .75 .90 .95
10 17 26 33 38
lowest : 0 1 2 3 4, highest: 56 57 59 61 70
--------------------------------------------------------------------------------
IC20
n missing distinct Info Mean Gmd .05 .10
10000 0 42 0.991 6.48 6.883 0 0
.25 .50 .75 .90 .95
2 4 10 16 20
lowest : 0 1 2 3 4, highest: 37 38 39 42 55
--------------------------------------------------------------------------------
IC21
n missing distinct Info Mean Gmd .05 .10
10000 0 31 0.918 2.533 3.5 0 0
.25 .50 .75 .90 .95
0 1 3 8 11
lowest : 0 1 2 3 4, highest: 29 31 36 38 40
--------------------------------------------------------------------------------
IC22
n missing distinct Info Mean Gmd .05 .10
10000 0 21 0.729 1.091 1.737 0 0
.25 .50 .75 .90 .95
0 0 1 4 6
lowest : 0 1 2 3 4, highest: 16 17 18 19 33
--------------------------------------------------------------------------------
IC23
n missing distinct Info Mean Gmd .05 .10
10000 0 60 0.823 2.218 3.579 0 0
.25 .50 .75 .90 .95
0 0 2 6 11
lowest : 0 1 2 3 4, highest: 60 61 62 67 70
--------------------------------------------------------------------------------
HHAS1
n missing distinct Info Mean Gmd .05 .10
10000 0 95 0.999 26.58 14.85 7 11
.25 .50 .75 .90 .95
17 26 34 43 49
lowest : 0 1 2 3 4, highest: 91 92 93 95 99
--------------------------------------------------------------------------------
HHAS2
n missing distinct Info Mean Gmd .05 .10
10000 0 57 0.993 6.165 6.018 0 0
.25 .50 .75 .90 .95
2 5 8 13 18
lowest : 0 1 2 3 4, highest: 55 57 59 60 76
--------------------------------------------------------------------------------
HHAS3
n missing distinct Info Mean Gmd .05 .10
10000 0 96 1 42.65 19.28 15 21
.25 .50 .75 .90 .95
31 43 54 65 71
lowest : 0 1 2 3 4, highest: 91 92 94 96 99
--------------------------------------------------------------------------------
HHAS4
n missing distinct Info Mean Gmd .05 .10
10000 0 74 0.998 10.86 10.14 1 2
.25 .50 .75 .90 .95
4 8 15 23 30
lowest : 0 1 2 3 4, highest: 75 79 89 91 99
--------------------------------------------------------------------------------
MC1
n missing distinct Info Mean Gmd .05 .10
10000 0 93 1 48.26 17.76 25 30
.25 .50 .75 .90 .95
37 47 58 70 77
lowest : 0 5 8 9 10, highest: 94 95 96 98 99
--------------------------------------------------------------------------------
MC2
n missing distinct Info Mean Gmd .05 .10
10000 0 94 1 50.82 17.81 21 29
.25 .50 .75 .90 .95
41 53 62 69 74
lowest : 0 1 2 4 5, highest: 90 91 92 95 99
--------------------------------------------------------------------------------
MC3
n missing distinct Info Mean Gmd .05 .10
10000 0 78 0.998 12.16 10.32 1 3
.25 .50 .75 .90 .95
5 9 16 25 31
lowest : 0 1 2 3 4, highest: 84 86 88 91 99
--------------------------------------------------------------------------------
TPE1
n missing distinct Info Mean Gmd .05 .10
10000 0 84 0.999 76.05 12.83 54 63
.25 .50 .75 .90 .95
72 79 84 88 91
lowest : 0 6 11 16 17, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
TPE2
n missing distinct Info Mean Gmd .05 .10
10000 0 50 0.997 13 7.268 3 5
.25 .50 .75 .90 .95
9 12 17 21 25
lowest : 0 1 2 3 4, highest: 45 46 47 49 56
--------------------------------------------------------------------------------
TPE3
n missing distinct Info Mean Gmd .05 .10
10000 0 57 0.848 2.34 3.726 0 0
.25 .50 .75 .90 .95
0 0 2 6 12
lowest : 0 1 2 3 4, highest: 53 55 58 60 62
--------------------------------------------------------------------------------
TPE4
n missing distinct Info Mean Gmd .05 .10
10000 0 52 0.797 1.81 2.953 0 0
.25 .50 .75 .90 .95
0 0 2 5 9
lowest : 0 1 2 3 4, highest: 50 51 55 58 62
--------------------------------------------------------------------------------
TPE5
n missing distinct Info Mean Gmd .05 .10
10000 0 31 0.224 0.4137 0.7936 0 0
.25 .50 .75 .90 .95
0 0 0 0 2
lowest : 0 1 2 3 4, highest: 29 32 36 44 50
--------------------------------------------------------------------------------
TPE6
n missing distinct Info Mean Gmd .05 .10
10000 0 18 0.156 0.1101 0.2132 0 0
.25 .50 .75 .90 .95
0 0 0 0 1
lowest : 0 1 2 3 4, highest: 21 26 27 29 30
Value 0 1 2 3 4 5 6 7 8 9 11
Frequency 9450 359 100 44 18 7 2 3 4 2 3
Proportion 0.945 0.036 0.010 0.004 0.002 0.001 0.000 0.000 0.000 0.000 0.000
Value 13 14 21 26 27 29 30
Frequency 1 1 1 1 2 1 1
Proportion 0.000 0.000 0.000 0.000 0.000 0.000 0.000
--------------------------------------------------------------------------------
TPE7
n missing distinct Info Mean Gmd .05 .10
10000 0 14 0.411 0.242 0.4258 0 0
.25 .50 .75 .90 .95
0 0 0 1 1
lowest : 0 1 2 3 4, highest: 10 11 12 16 22
Value 0 1 2 3 4 5 6 7 8 10 11
Frequency 8373 1140 333 92 33 15 4 4 1 1 1
Proportion 0.837 0.114 0.033 0.009 0.003 0.002 0.000 0.000 0.000 0.000 0.000
Value 12 16 22
Frequency 1 1 1
Proportion 0.000 0.000 0.000
--------------------------------------------------------------------------------
TPE8
n missing distinct Info Mean Gmd .05 .10
10000 0 60 0.984 4.128 4.588 0 0
.25 .50 .75 .90 .95
1 3 5 9 13
lowest : 0 1 2 3 4, highest: 62 65 68 69 99
--------------------------------------------------------------------------------
TPE9
n missing distinct Info Mean Gmd .05 .10
10000 0 38 0.98 3.268 3.34 0 0
.25 .50 .75 .90 .95
1 2 4 7 10
lowest : 0 1 2 3 4, highest: 34 35 36 40 55
--------------------------------------------------------------------------------
PEC1
n missing distinct Info Mean Gmd .05 .10
10000 0 69 0.883 2.311 3.675 0 0
.25 .50 .75 .90 .95
0 1 2 4 10
lowest : 0 1 2 3 4, highest: 67 68 72 75 99
--------------------------------------------------------------------------------
PEC2
n missing distinct Info Mean Gmd .05 .10
10000 0 97 0.999 18.36 19.97 0 1
.25 .50 .75 .90 .95
4 11 27 48 60
lowest : 0 1 2 3 4, highest: 92 94 95 96 99
--------------------------------------------------------------------------------
TPE10
n missing distinct Info Mean Gmd .05 .10
10000 0 52 0.997 19.39 7.059 10 12
.25 .50 .75 .90 .95
15 19 23 28 31
lowest : 0 3 4 5 6, highest: 51 52 53 59 90
--------------------------------------------------------------------------------
TPE11
n missing distinct Info Mean Gmd .05 .10
10000 0 53 0.998 23.63 7.265 14 16
.25 .50 .75 .90 .95
19 23 28 32 35
lowest : 0 3 5 7 8, highest: 52 53 54 57 64
--------------------------------------------------------------------------------
TPE12
n missing distinct Info Mean Gmd .05 .10
10000 0 45 0.991 5.242 5.177 0 0
.25 .50 .75 .90 .95
2 4 7 12 15
lowest : 0 1 2 3 4, highest: 41 43 44 47 50
--------------------------------------------------------------------------------
TPE13
n missing distinct Info Mean Gmd .05 .10
10000 0 97 1 59.89 19.25 26 35
.25 .50 .75 .90 .95
50 63 73 79 82
lowest : 0 3 4 5 6, highest: 94 95 96 97 99
--------------------------------------------------------------------------------
LFC1
n missing distinct Info Mean Gmd .05 .10
10000 0 96 0.999 64.47 14.01 42 50
.25 .50 .75 .90 .95
59 66 73 79 82
lowest : 0 2 5 6 7, highest: 94 96 97 98 99
--------------------------------------------------------------------------------
LFC2
n missing distinct Info Mean Gmd .05 .10
10000 0 97 0.999 73.82 14.65 49 59
.25 .50 .75 .90 .95
68 76 83 88 91
lowest : 0 2 4 6 7, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
LFC3
n missing distinct Info Mean Gmd .05 .10
10000 0 96 0.999 55.94 14.74 33 40
.25 .50 .75 .90 .95
49 57 65 71 76
lowest : 0 2 4 5 6, highest: 93 94 95 96 99
--------------------------------------------------------------------------------
LFC4
n missing distinct Info Mean Gmd .05 .10
10000 0 98 0.999 69.79 15.43 43.95 53.00
.25 .50 .75 .90 .95
63.00 72.00 79.00 85.00 88.00
lowest : 0 2 4 5 6, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
LFC5
n missing distinct Info Mean Gmd .05 .10
10000 0 95 0.999 52.83 14.91 30 37
.25 .50 .75 .90 .95
45 54 62 69 73
lowest : 0 2 4 5 6, highest: 92 93 94 96 99
--------------------------------------------------------------------------------
LFC6
n missing distinct Info Mean Gmd .05 .10
10000 0 94 0.999 63.99 17.21 36.0 46.0
.25 .50 .75 .90 .95
57.0 66.0 74.0 81.1 86.0
lowest : 0 4 7 8 10, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
LFC7
n missing distinct Info Mean Gmd .05 .10
10000 0 98 1 48.79 19.02 17 27
.25 .50 .75 .90 .95
40 50 60 69 75
lowest : 0 1 2 3 4, highest: 93 94 95 96 99
--------------------------------------------------------------------------------
LFC8
n missing distinct Info Mean Gmd .05 .10
10000 0 95 0.974 69.93 33 0 8
.25 .50 .75 .90 .95
55 78 99 99 99
lowest : 0 3 7 8 9, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
LFC9
n missing distinct Info Mean Gmd .05 .10
10000 0 91 0.848 62.91 45.3 0 0
.25 .50 .75 .90 .95
0 97 99 99 99
lowest : 0 6 7 11 13, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
LFC10
n missing distinct Info Mean Gmd .05 .10
10000 0 78 0.972 6.568 8.268 0 0
.25 .50 .75 .90 .95
0 4 9 17 24
lowest : 0 1 2 3 4, highest: 80 82 84 95 99
--------------------------------------------------------------------------------
OCC1
n missing distinct Info Mean Gmd .05 .10
10000 0 59 0.998 14.07 8.897 3 5
.25 .50 .75 .90 .95
8 13 18 25 30
lowest : 0 1 2 3 4, highest: 55 58 59 61 99
--------------------------------------------------------------------------------
OCC2
n missing distinct Info Mean Gmd .05 .10
10000 0 47 0.998 12.48 7.738 3 5
.25 .50 .75 .90 .95
8 11 17 22 26
lowest : 0 1 2 3 4, highest: 42 43 44 46 68
--------------------------------------------------------------------------------
OCC3
n missing distinct Info Mean Gmd .05 .10
10000 0 25 0.981 3.567 2.693 0 0
.25 .50 .75 .90 .95
2 3 5 7 8
lowest : 0 1 2 3 4, highest: 21 22 30 33 99
--------------------------------------------------------------------------------
OCC4
n missing distinct Info Mean Gmd .05 .10
10000 0 49 0.996 12.31 6.056 4 6
.25 .50 .75 .90 .95
9 12 15 19 22
lowest : 0 1 2 3 4, highest: 44 49 50 51 55
--------------------------------------------------------------------------------
OCC5
n missing distinct Info Mean Gmd .05 .10
10000 0 48 0.997 15.48 6.239 7 9
.25 .50 .75 .90 .95
12 15 19 22 25
lowest : 0 1 2 3 4, highest: 43 50 52 63 68
--------------------------------------------------------------------------------
OCC6
n missing distinct Info Mean Gmd .05 .10
10000 0 18 0.592 0.4224 0.6911 0 0
.25 .50 .75 .90 .95
0 0 1 1 2
lowest : 0 1 2 3 4, highest: 13 14 15 18 20
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 7381 1801 469 190 71 27 29 10 5 4 2
Proportion 0.738 0.180 0.047 0.019 0.007 0.003 0.003 0.001 0.000 0.000 0.000
Value 11 12 13 14 15 18 20
Frequency 3 1 2 1 1 2 1
Proportion 0.000 0.000 0.000 0.000 0.000 0.000 0.000
--------------------------------------------------------------------------------
OCC7
n missing distinct Info Mean Gmd .05 .10
10000 0 20 0.945 1.657 1.776 0 0
.25 .50 .75 .90 .95
0 1 2 4 5
lowest : 0 1 2 3 4, highest: 15 19 21 24 27
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 2911 2685 2115 1095 568 294 148 61 33 27 15
Proportion 0.291 0.268 0.212 0.110 0.057 0.029 0.015 0.006 0.003 0.003 0.002
Value 11 12 13 14 15 19 21 24 27
Frequency 14 6 7 12 5 1 1 1 1
Proportion 0.001 0.001 0.001 0.001 0.000 0.000 0.000 0.000 0.000
--------------------------------------------------------------------------------
OCC8
n missing distinct Info Mean Gmd .05 .10
10000 0 53 0.996 10.64 6.399 3 4
.25 .50 .75 .90 .95
7 10 13 18 22
lowest : 0 1 2 3 4, highest: 49 50 52 53 74
--------------------------------------------------------------------------------
OCC9
n missing distinct Info Mean Gmd .05 .10
10000 0 49 0.944 2.832 3.925 0 0
.25 .50 .75 .90 .95
0 1 3 7 12
lowest : 0 1 2 3 4, highest: 47 49 50 52 99
--------------------------------------------------------------------------------
OCC10
n missing distinct Info Mean Gmd .05 .10
10000 0 46 0.997 11.52 6.623 2 4
.25 .50 .75 .90 .95
7 11 15 19 21
lowest : 0 1 2 3 4, highest: 46 49 63 65 99
--------------------------------------------------------------------------------
OCC11
n missing distinct Info Mean Gmd .05 .10
10000 0 44 0.994 6.3 6.059 0 0
.25 .50 .75 .90 .95
2 5 9 14 18
lowest : 0 1 2 3 4, highest: 42 48 50 54 61
--------------------------------------------------------------------------------
OCC12
n missing distinct Info Mean Gmd .05 .10
10000 0 29 0.988 4.077 3.462 0 0
.25 .50 .75 .90 .95
2 4 6 8 10
lowest : 0 1 2 3 4, highest: 27 31 48 50 55
--------------------------------------------------------------------------------
OCC13
n missing distinct Info Mean Gmd .05 .10
10000 0 29 0.985 3.679 3.108 0 0
.25 .50 .75 .90 .95
2 3 5 7 9
lowest : 0 1 2 3 4, highest: 24 27 30 36 38
--------------------------------------------------------------------------------
EIC1
n missing distinct Info Mean Gmd .05 .10
10000 0 52 0.955 3.1 4.219 0 0
.25 .50 .75 .90 .95
0 1 4 8 13
lowest : 0 1 2 3 4, highest: 50 51 54 55 99
--------------------------------------------------------------------------------
EIC2
n missing distinct Info Mean Gmd .05 .10
10000 0 35 0.537 0.7198 1.278 0 0
.25 .50 .75 .90 .95
0 0 0 2 4
lowest : 0 1 2 3 4, highest: 34 37 38 39 56
--------------------------------------------------------------------------------
EIC3
n missing distinct Info Mean Gmd .05 .10
10000 0 34 0.993 6.449 4.388 0 2
.25 .50 .75 .90 .95
4 6 9 11 14
lowest : 0 1 2 3 4, highest: 29 30 36 38 50
--------------------------------------------------------------------------------
EIC4
n missing distinct Info Mean Gmd .05 .10
10000 0 68 0.999 17 11.29 3 5
.25 .50 .75 .90 .95
9 15 23 31 36
lowest : 0 1 2 3 4, highest: 64 65 66 72 99
--------------------------------------------------------------------------------
EIC5
n missing distinct Info Mean Gmd .05 .10
10000 0 32 0.986 4.351 3.357 0 1
.25 .50 .75 .90 .95
2 4 6 8 10
lowest : 0 1 2 3 4, highest: 29 30 32 35 99
--------------------------------------------------------------------------------
EIC6
n missing distinct Info Mean Gmd .05 .10
10000 0 23 0.974 2.707 2.394 0 0
.25 .50 .75 .90 .95
1 2 4 6 7
lowest : 0 1 2 3 4, highest: 18 19 21 24 38
--------------------------------------------------------------------------------
EIC7
n missing distinct Info Mean Gmd .05 .10
10000 0 27 0.987 4.343 3.247 0 1
.25 .50 .75 .90 .95
2 4 6 8 10
lowest : 0 1 2 3 4, highest: 23 27 32 36 48
--------------------------------------------------------------------------------
EIC8
n missing distinct Info Mean Gmd .05 .10
10000 0 57 0.997 16.92 6.69 8 10
.25 .50 .75 .90 .95
13 16 20 24 27
lowest : 0 1 2 3 4, highest: 65 68 72 85 99
--------------------------------------------------------------------------------
EIC9
n missing distinct Info Mean Gmd .05 .10
10000 0 38 0.994 6.806 4.946 0 2
.25 .50 .75 .90 .95
4 6 9 13 15
lowest : 0 1 2 3 4, highest: 33 34 35 41 59
--------------------------------------------------------------------------------
EIC10
n missing distinct Info Mean Gmd .05 .10
10000 0 26 0.986 4.639 3.143 0 1
.25 .50 .75 .90 .95
3 4 6 8 10
lowest : 0 1 2 3 4, highest: 21 23 24 28 78
--------------------------------------------------------------------------------
EIC11
n missing distinct Info Mean Gmd .05 .10
10000 0 39 0.976 3.313 2.971 0 0
.25 .50 .75 .90 .95
2 3 4 6 9
lowest : 0 1 2 3 4, highest: 41 45 51 60 99
--------------------------------------------------------------------------------
EIC12
n missing distinct Info Mean Gmd .05 .10
10000 0 32 0.927 1.557 1.901 0 0
.25 .50 .75 .90 .95
0 1 2 4 5
lowest : 0 1 2 3 4, highest: 30 32 33 50 53
--------------------------------------------------------------------------------
EIC13
n missing distinct Info Mean Gmd .05 .10
10000 0 41 0.993 8.057 4.698 2.0 3.0
.25 .50 .75 .90 .95
5.0 7.5 10.0 13.0 16.0
lowest : 0 1 2 3 4, highest: 36 41 42 46 60
--------------------------------------------------------------------------------
EIC14
n missing distinct Info Mean Gmd .05 .10
10000 0 53 0.994 8.049 5.41 1 3
.25 .50 .75 .90 .95
5 7 10 14 17
lowest : 0 1 2 3 4, highest: 55 56 61 66 69
--------------------------------------------------------------------------------
EIC15
n missing distinct Info Mean Gmd .05 .10
10000 0 42 0.992 6.421 4.495 1 2
.25 .50 .75 .90 .95
4 6 8 12 15
lowest : 0 1 2 3 4, highest: 45 48 50 52 99
--------------------------------------------------------------------------------
EIC16
n missing distinct Info Mean Gmd .05 .10
10000 0 39 0.988 4.614 3.995 0 1
.25 .50 .75 .90 .95
2 4 6 9 12
lowest : 0 1 2 3 4, highest: 35 36 37 39 54
--------------------------------------------------------------------------------
OEDC1
n missing distinct Info Mean Gmd .05 .10
10000 0 38 0.993 7.039 4.506 1 2
.25 .50 .75 .90 .95
4 6 9 12 15
lowest : 0 1 2 3 4, highest: 37 38 43 44 57
--------------------------------------------------------------------------------
OEDC2
n missing distinct Info Mean Gmd .05 .10
10000 0 49 0.988 4.693 4.766 0 0
.25 .50 .75 .90 .95
2 3 6 10 14
lowest : 0 1 2 3 4, highest: 45 46 48 52 61
--------------------------------------------------------------------------------
OEDC3
n missing distinct Info Mean Gmd .05 .10
10000 0 45 0.976 3.232 3.599 0 0
.25 .50 .75 .90 .95
1 2 4 7 10
lowest : 0 1 2 3 4, highest: 41 44 46 49 50
--------------------------------------------------------------------------------
OEDC4
n missing distinct Info Mean Gmd .05 .10
10000 0 46 0.995 8.013 5.691 1 2
.25 .50 .75 .90 .95
4 7 10 15 18
lowest : 0 1 2 3 4, highest: 41 42 43 44 99
--------------------------------------------------------------------------------
OEDC5
n missing distinct Info Mean Gmd .05 .10
10000 0 77 0.999 69.45 12.64 50 56
.25 .50 .75 .90 .95
64 71 77 82 85
lowest : 0 19 21 24 26, highest: 94 95 96 97 99
--------------------------------------------------------------------------------
OEDC6
n missing distinct Info Mean Gmd .05 .10
10000 0 43 0.991 6.085 4.313 0 2
.25 .50 .75 .90 .95
3 5 8 11 13
lowest : 0 1 2 3 4, highest: 46 55 58 74 99
--------------------------------------------------------------------------------
OEDC7
n missing distinct Info Mean Gmd .05 .10
10000 0 15 0.672 0.5065 0.7859 0 0
.25 .50 .75 .90 .95
0 0 1 2 2
lowest : 0 1 2 3 4, highest: 10 11 12 13 49
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 6826 2130 651 226 75 35 19 10 14 4 4
Proportion 0.683 0.213 0.065 0.023 0.007 0.004 0.002 0.001 0.001 0.000 0.000
Value 11 12 13 49
Frequency 2 2 1 1
Proportion 0.000 0.000 0.000 0.000
--------------------------------------------------------------------------------
EC1
n missing distinct Info Mean Gmd .05 .10
10000 0 79 0.874 127.9 15.78 120 120
.25 .50 .75 .90 .95
120 120 140 151 160
lowest : 0 89 90 91 92, highest: 162 163 164 166 170
--------------------------------------------------------------------------------
EC2
n missing distinct Info Mean Gmd .05 .10
10000 0 63 0.997 8.786 8.451 0 1
.25 .50 .75 .90 .95
3 6 12 20 25
lowest : 0 1 2 3 4, highest: 61 62 67 68 78
--------------------------------------------------------------------------------
EC3
n missing distinct Info Mean Gmd .05 .10
10000 0 54 0.998 12.95 8.617 2 4
.25 .50 .75 .90 .95
7 12 18 23 26
lowest : 0 1 2 3 4, highest: 55 58 62 82 99
--------------------------------------------------------------------------------
EC4
n missing distinct Info Mean Gmd .05 .10
10000 0 65 0.999 28.74 11.56 11 15
.25 .50 .75 .90 .95
22 29 36 41 44
lowest : 0 1 2 3 4, highest: 60 61 63 71 99
--------------------------------------------------------------------------------
EC5
n missing distinct Info Mean Gmd .05 .10
10000 0 54 0.998 20.77 7.97 10 12
.25 .50 .75 .90 .95
16 21 25 30 32
lowest : 0 1 2 3 4, highest: 50 51 52 60 61
--------------------------------------------------------------------------------
EC6
n missing distinct Info Mean Gmd .05 .10
10000 0 25 0.991 6.522 3.706 2 2
.25 .50 .75 .90 .95
4 6 9 11 12
lowest : 0 1 2 3 4, highest: 20 21 22 23 30
--------------------------------------------------------------------------------
EC7
n missing distinct Info Mean Gmd .05 .10
10000 0 55 0.999 13.97 10.72 2 4
.25 .50 .75 .90 .95
6 12 20 28 33
lowest : 0 1 2 3 4, highest: 50 53 55 58 99
--------------------------------------------------------------------------------
EC8
n missing distinct Info Mean Gmd .05 .10
10000 0 56 0.995 7.33 7.086 0 1
.25 .50 .75 .90 .95
3 5 10 17 22
lowest : 0 1 2 3 4, highest: 58 59 67 75 99
--------------------------------------------------------------------------------
SEC1
n missing distinct Info Mean Gmd .05 .10
10000 0 44 0.983 3.73 3.397 0 1
.25 .50 .75 .90 .95
1 3 5 7 10
lowest : 0 1 2 3 4, highest: 59 63 68 73 74
--------------------------------------------------------------------------------
SEC2
n missing distinct Info Mean Gmd .05 .10
10000 0 76 0.998 22.22 8.287 9 13
.25 .50 .75 .90 .95
18 23 26 31 33
lowest : 0 1 2 3 4, highest: 84 85 90 95 99
--------------------------------------------------------------------------------
SEC3
n missing distinct Info Mean Gmd .05 .10
10000 0 15 0.94 1.894 1.453 0 0
.25 .50 .75 .90 .95
1 2 3 4 4
lowest : 0 1 2 3 4, highest: 10 11 12 13 19
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 1349 2886 3071 1576 684 286 84 38 11 7 2
Proportion 0.135 0.289 0.307 0.158 0.068 0.029 0.008 0.004 0.001 0.001 0.000
Value 11 12 13 19
Frequency 1 3 1 1
Proportion 0.000 0.000 0.000 0.000
--------------------------------------------------------------------------------
SEC4
n missing distinct Info Mean Gmd .05 .10
10000 0 45 0.997 17.23 7.026 5 9
.25 .50 .75 .90 .95
14 18 21 25 27
lowest : 0 1 2 3 4, highest: 40 42 44 48 51
--------------------------------------------------------------------------------
SEC5
n missing distinct Info Mean Gmd .05 .10
10000 0 74 0.991 6.835 4.732 2 2
.25 .50 .75 .90 .95
4 6 8 11 14
lowest : 0 1 2 3 4, highest: 81 86 92 97 99
--------------------------------------------------------------------------------
AFC1
n missing distinct Info Mean Gmd .05 .10
10000 0 39 0.447 0.5791 1.07 0 0
.25 .50 .75 .90 .95
0 0 0 1 2
lowest : 0 1 2 3 4, highest: 71 74 78 84 91
--------------------------------------------------------------------------------
AFC2
n missing distinct Info Mean Gmd .05 .10
10000 0 55 0.542 1.025 1.867 0 0
.25 .50 .75 .90 .95
0 0 0 2 4
lowest : 0 1 2 3 4, highest: 90 92 93 94 97
--------------------------------------------------------------------------------
AFC3
n missing distinct Info Mean Gmd .05 .10
10000 0 22 0.16 0.1475 0.2866 0 0
.25 .50 .75 .90 .95
0 0 0 0 1
lowest : 0 1 2 3 4, highest: 21 36 48 54 78
--------------------------------------------------------------------------------
AFC4
n missing distinct Info Mean Gmd .05 .10
10000 0 46 0.996 15.58 5.821 7 9
.25 .50 .75 .90 .95
12 15 19 22 24
lowest : 0 1 2 3 4, highest: 41 44 46 50 53
--------------------------------------------------------------------------------
AFC5
n missing distinct Info Mean Gmd .05 .10
10000 0 83 0.999 31.33 11.61 15 19
.25 .50 .75 .90 .95
25 31 37 44 49
lowest : 0 1 2 3 4, highest: 82 83 84 85 99
--------------------------------------------------------------------------------
AFC6
n missing distinct Info Mean Gmd .05 .10
10000 0 21 0.908 1.262 1.545 0 0
.25 .50 .75 .90 .95
0 1 2 3 4
lowest : 0 1 2 3 4, highest: 16 17 20 23 27
--------------------------------------------------------------------------------
VC1
n missing distinct Info Mean Gmd .05 .10
10000 0 91 0.999 30.62 16.2 7 13
.25 .50 .75 .90 .95
21 30 39 49 56
lowest : 0 1 2 3 4, highest: 88 90 91 93 99
--------------------------------------------------------------------------------
VC2
n missing distinct Info Mean Gmd .05 .10
10000 0 68 0.998 17.92 10.6 0 6
.25 .50 .75 .90 .95
12 17 23 30 35
lowest : 0 1 2 3 4, highest: 68 74 83 88 99
--------------------------------------------------------------------------------
VC3
n missing distinct Info Mean Gmd .05 .10
10000 0 98 1 33.21 18.87 7 12
.25 .50 .75 .90 .95
22 32 43 55 63
lowest : 0 1 2 3 4, highest: 94 95 96 97 99
--------------------------------------------------------------------------------
VC4
n missing distinct Info Mean Gmd .05 .10
10000 0 80 0.994 11.32 10.84 0 0
.25 .50 .75 .90 .95
4 9 16 24 31
lowest : 0 1 2 3 4, highest: 85 86 91 92 99
--------------------------------------------------------------------------------
ANC1
n missing distinct Info Mean Gmd .05 .10
10000 0 38 0.772 0.7341 1.091 0 0
.25 .50 .75 .90 .95
0 0 1 2 2
lowest : 0 1 2 3 4, highest: 47 48 50 54 61
--------------------------------------------------------------------------------
ANC2
n missing distinct Info Mean Gmd .05 .10
10000 0 37 0.993 5.457 4.426 0 1
.25 .50 .75 .90 .95
2 5 8 11 13
lowest : 0 1 2 3 4, highest: 33 41 43 49 50
--------------------------------------------------------------------------------
ANC3
n missing distinct Info Mean Gmd .05 .10
10000 0 21 0.847 0.8512 1.063 0 0
.25 .50 .75 .90 .95
0 1 1 2 3
lowest : 0 1 2 3 4, highest: 17 18 19 21 26
--------------------------------------------------------------------------------
ANC4
n missing distinct Info Mean Gmd .05 .10
10000 0 69 0.997 10.16 8.11 1 2
.25 .50 .75 .90 .95
5 8 13 19 26
lowest : 0 1 2 3 4, highest: 66 68 70 72 75
--------------------------------------------------------------------------------
ANC5
n missing distinct Info Mean Gmd .05 .10
10000 0 13 0.324 0.1802 0.328 0 0
.25 .50 .75 .90 .95
0 0 0 1 1
lowest : 0 1 2 3 4, highest: 8 9 10 12 13
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 8772 928 174 64 31 17 1 3 3 1 4
Proportion 0.877 0.093 0.017 0.006 0.003 0.002 0.000 0.000 0.000 0.000 0.000
Value 12 13
Frequency 1 1
Proportion 0.000 0.000
--------------------------------------------------------------------------------
ANC6
n missing distinct Info Mean Gmd .05 .10
10000 0 10 0.341 0.1693 0.3032 0 0
.25 .50 .75 .90 .95
0 0 0 1 1
lowest : 0 1 2 3 4, highest: 5 6 7 8 9
Value 0 1 2 3 4 5 6 7 8 9
Frequency 8696 1051 178 40 23 5 3 2 1 1
Proportion 0.870 0.105 0.018 0.004 0.002 0.000 0.000 0.000 0.000 0.000
--------------------------------------------------------------------------------
ANC7
n missing distinct Info Mean Gmd .05 .10
10000 0 33 0.989 4.838 3.548 0 1
.25 .50 .75 .90 .95
3 4 7 9 11
lowest : 0 1 2 3 4, highest: 33 37 45 48 99
--------------------------------------------------------------------------------
ANC8
n missing distinct Info Mean Gmd .05 .10
10000 0 31 0.936 1.79 2.216 0 0
.25 .50 .75 .90 .95
0 1 2 4 6
lowest : 0 1 2 3 4, highest: 27 31 34 40 45
--------------------------------------------------------------------------------
ANC9
n missing distinct Info Mean Gmd .05 .10
10000 0 36 0.707 0.7888 1.284 0 0
.25 .50 .75 .90 .95
0 0 1 2 3
lowest : 0 1 2 3 4, highest: 35 44 47 49 54
--------------------------------------------------------------------------------
ANC10
n missing distinct Info Mean Gmd .05 .10
10000 0 39 0.858 1.304 1.914 0 0
.25 .50 .75 .90 .95
0 1 1 3 5
lowest : 0 1 2 3 4, highest: 37 39 44 50 65
--------------------------------------------------------------------------------
ANC11
n missing distinct Info Mean Gmd .05 .10
10000 0 19 0.239 0.1627 0.3078 0 0
.25 .50 .75 .90 .95
0 0 0 0 1
lowest : 0 1 2 3 4, highest: 17 19 20 25 31
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 9129 574 147 63 27 25 16 3 5 1 1
Proportion 0.913 0.057 0.015 0.006 0.003 0.002 0.002 0.000 0.000 0.000 0.000
Value 11 12 15 17 19 20 25 31
Frequency 2 1 1 1 1 1 1 1
Proportion 0.000 0.000 0.000 0.000 0.000 0.000 0.000 0.000
--------------------------------------------------------------------------------
ANC12
n missing distinct Info Mean Gmd .05 .10
10000 0 21 0.436 0.3829 0.6918 0 0
.25 .50 .75 .90 .95
0 0 0 1 2
lowest : 0 1 2 3 4, highest: 16 17 19 30 52
--------------------------------------------------------------------------------
ANC13
n missing distinct Info Mean Gmd .05 .10
10000 0 14 0.828 0.726 0.9066 0 0
.25 .50 .75 .90 .95
0 1 1 2 2
lowest : 0 1 2 3 4, highest: 9 10 11 17 50
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 4998 3588 950 280 110 42 17 6 3 1 1
Proportion 0.500 0.359 0.095 0.028 0.011 0.004 0.002 0.001 0.000 0.000 0.000
Value 11 17 50
Frequency 1 2 1
Proportion 0.000 0.000 0.000
--------------------------------------------------------------------------------
ANC14
n missing distinct Info Mean Gmd .05 .10
10000 0 20 0.79 0.7226 1.025 0 0
.25 .50 .75 .90 .95
0 0 1 2 3
lowest : 0 1 2 3 4, highest: 16 18 21 23 25
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 5725 2798 847 309 149 73 36 22 13 4 2
Proportion 0.573 0.280 0.085 0.031 0.015 0.007 0.004 0.002 0.001 0.000 0.000
Value 11 12 14 15 16 18 21 23 25
Frequency 6 6 2 1 3 1 1 1 1
Proportion 0.001 0.001 0.000 0.000 0.000 0.000 0.000 0.000 0.000
--------------------------------------------------------------------------------
ANC15
n missing distinct Info Mean Gmd
10000 0 8 0.164 0.0777 0.1485
lowest : 0 1 2 3 4, highest: 3 4 5 6 32
Value 0 1 2 3 4 5 6 32
Frequency 9421 475 70 13 12 5 3 1
Proportion 0.942 0.048 0.007 0.001 0.001 0.000 0.000 0.000
--------------------------------------------------------------------------------
POBC1
n missing distinct Info Mean Gmd .05 .10
10000 0 81 0.988 6.645 8.429 0 0
.25 .50 .75 .90 .95
1 3 8 17 26
lowest : 0 1 2 3 4, highest: 78 79 80 83 86
--------------------------------------------------------------------------------
POBC2
n missing distinct Info Mean Gmd .05 .10
10000 0 100 1 57.07 24.5 17 27
.25 .50 .75 .90 .95
43 59 75 84 87
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
LSC1
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.995 87.75 14.11 54 71
.25 .50 .75 .90 .95
86 94 97 98 99
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
LSC2
n missing distinct Info Mean Gmd .05 .10
10000 0 99 0.975 6.077 8.759 0.00 0.00
.25 .50 .75 .90 .95
1.00 2.00 5.00 15.00 27.05
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
LSC3
n missing distinct Info Mean Gmd .05 .10
10000 0 53 0.785 1.817 2.997 0 0
.25 .50 .75 .90 .95
0 0 2 5 9
lowest : 0 1 2 3 4, highest: 51 55 56 59 66
--------------------------------------------------------------------------------
LSC4
n missing distinct Info Mean Gmd .05 .10
10000 0 53 0.977 3.399 3.623 0 0
.25 .50 .75 .90 .95
1 2 4 7 10
lowest : 0 1 2 3 4, highest: 58 65 67 77 90
--------------------------------------------------------------------------------
VOC1
n missing distinct Info Mean Gmd .05 .10
10000 0 76 0.987 91.75 9.38 74 83
.25 .50 .75 .90 .95
90 95 98 99 99
lowest : 0 7 17 18 19, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
VOC2
n missing distinct Info Mean Gmd .05 .10
10000 0 100 1 60.23 20.84 25 35
.25 .50 .75 .90 .95
49 62 74 83 87
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
VOC3
n missing distinct Info Mean Gmd .05 .10
10000 0 68 0.999 20.06 12.12 3.95 7.00
.25 .50 .75 .90 .95
12.00 19.00 27.00 34.00 39.00
lowest : 0 1 2 3 4, highest: 66 69 71 73 99
--------------------------------------------------------------------------------
HC1
n missing distinct Info Mean Gmd .05 .10
10000 0 32 0.99 6.83 4.276 2 3
.25 .50 .75 .90 .95
4 6 9 12 14
lowest : 0 1 2 3 4, highest: 27 28 29 30 31
--------------------------------------------------------------------------------
HC2
n missing distinct Info Mean Gmd .05 .10
10000 0 53 0.999 23.84 14.1 6 9
.25 .50 .75 .90 .95
15 21 32 43 51
lowest : 0 1 2 3 4, highest: 48 49 50 51 52
--------------------------------------------------------------------------------
HC3
n missing distinct Info Mean Gmd .05 .10
10000 0 62 0.853 2.549 3.934 0 0
.25 .50 .75 .90 .95
0 0 3 7 11
lowest : 0 1 2 3 4, highest: 76 81 82 98 99
--------------------------------------------------------------------------------
HC4
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.978 12.29 15.69 0 0
.25 .50 .75 .90 .95
0 6 17 32 47
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HC5
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.993 22.66 25.13 0 0
.25 .50 .75 .90 .95
3 16 34 57 75
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HC6
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.999 46.07 36.86 0 2
.25 .50 .75 .90 .95
16 47 73 93 98
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HC7
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.998 62.71 35.67 3.0 10.9
.25 .50 .75 .90 .95
38.0 71.0 91.0 99.0 99.0
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HC8
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.999 35.81 35.05 0 0
.25 .50 .75 .90 .95
8 28 59 87 95
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HC9
n missing distinct Info Mean Gmd .05 .10
10000 0 69 0.603 2.563 4.505 0 0
.25 .50 .75 .90 .95
0 0 1 8 16
lowest : 0 1 2 3 4, highest: 73 74 75 79 89
--------------------------------------------------------------------------------
HC10
n missing distinct Info Mean Gmd .05 .10
10000 0 32 0.61 1.29 2.215 0 0
.25 .50 .75 .90 .95
0 0 1 4 8
lowest : 0 1 2 3 4, highest: 27 28 29 30 34
--------------------------------------------------------------------------------
HC11
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.999 52.29 40.43 0.0 0.0
.25 .50 .75 .90 .95
15.0 59.5 87.0 94.1 97.0
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HC12
n missing distinct Info Mean Gmd .05 .10
10000 0 84 0.941 6.763 10.2 0 0
.25 .50 .75 .90 .95
0 1 7 24 35
lowest : 0 1 2 3 4, highest: 83 85 86 89 93
--------------------------------------------------------------------------------
HC13
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.999 29.46 30.01 0.00 2.00
.25 .50 .75 .90 .95
7.00 20.00 45.00 75.00 90.05
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HC14
n missing distinct Info Mean Gmd .05 .10
10000 0 73 0.83 4.544 7.354 0 0
.25 .50 .75 .90 .95
0 0 4 16 26
lowest : 0 1 2 3 4, highest: 71 75 84 91 95
--------------------------------------------------------------------------------
HC15
n missing distinct Info Mean Gmd .05 .10
10000 0 13 0.116 0.0693 0.1349 0 0
.25 .50 .75 .90 .95
0 0 0 0 0
lowest : 0 1 2 3 4, highest: 8 9 10 12 30
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 9597 263 84 28 14 4 3 1 2 1 1
Proportion 0.960 0.026 0.008 0.003 0.001 0.000 0.000 0.000 0.000 0.000 0.000
Value 12 30
Frequency 1 1
Proportion 0.000 0.000
--------------------------------------------------------------------------------
HC16
n missing distinct Info Mean Gmd .05 .10
10000 0 83 0.91 5.367 8.324 0 0
.25 .50 .75 .90 .95
0 1 5 17 28
lowest : 0 1 2 3 4, highest: 79 83 85 89 99
--------------------------------------------------------------------------------
HC17
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.855 82.79 25.14 8 32
.25 .50 .75 .90 .95
80 99 99 99 99
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HC18
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.88 14.97 23.23 0 0
.25 .50 .75 .90 .95
0 1 17 60 84
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HC19
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.946 72.13 36.26 2 6
.25 .50 .75 .90 .95
45 95 99 99 99
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HC20
n missing distinct Info Mean Gmd .05 .10
10000 0 32 0.451 97.54 2.78 96 97
.25 .50 .75 .90 .95
99 99 99 99 99
lowest : 0 27 38 50 60, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HC21
n missing distinct Info Mean Gmd .05 .10
10000 0 50 0.926 94.37 6.863 83 88
.25 .50 .75 .90 .95
94 98 99 99 99
lowest : 0 9 32 33 44, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
MHUC1
n missing distinct Info Mean Gmd .05 .10
10000 0 22 0.987 8.036 3.676 4 5
.25 .50 .75 .90 .95
6 8 9 12 15
lowest : 0 1 2 3 4, highest: 17 18 19 20 21
--------------------------------------------------------------------------------
MHUC2
n missing distinct Info Mean Gmd
10000 0 6 0.784 2.315 0.8402
lowest : 0 1 2 3 4, highest: 1 2 3 4 5
Value 0 1 2 3 4 5
Frequency 388 392 5845 2522 760 93
Proportion 0.039 0.039 0.585 0.252 0.076 0.009
--------------------------------------------------------------------------------
AC1
n missing distinct Info Mean Gmd .05 .10
10000 0 28 0.985 5.811 3.09 1 2
.25 .50 .75 .90 .95
4 6 7 9 11
lowest : 0 1 2 3 4, highest: 24 26 27 28 38
--------------------------------------------------------------------------------
AC2
n missing distinct Info Mean Gmd .05 .10
10000 0 29 0.989 6.005 3.524 1 2
.25 .50 .75 .90 .95
4 6 8 10 12
lowest : 0 1 2 3 4, highest: 26 28 29 31 36
--------------------------------------------------------------------------------
ADATE_2
n missing distinct Info Mean Gmd
10000 0 2 0 9706 4e-04
Value 9704 9706
Frequency 1 9999
Proportion 0 1
--------------------------------------------------------------------------------
ADATE_3
n missing distinct Info Mean Gmd
9798 202 2 0.001 9606 0.001224
Value 9604 9606
Frequency 3 9795
Proportion 0 1
--------------------------------------------------------------------------------
ADATE_4
n missing distinct Info Mean Gmd
9777 223 7 0.029 9604 0.07266
lowest : 9511 9602 9603 9604 9605, highest: 9603 9604 9605 9607 9609
Value 9511 9602 9603 9604 9605 9607 9609
Frequency 1 2 3 9682 20 54 15
Proportion 0.000 0.000 0.000 0.990 0.002 0.006 0.002
--------------------------------------------------------------------------------
ADATE_5
n missing distinct Info Mean Gmd
6457 3543 1 0 9604 0
Value 9604
Frequency 6457
Proportion 1
--------------------------------------------------------------------------------
ADATE_6
n missing distinct Info Mean Gmd
9614 386 2 0.001 9603 0.001664
Value 9601 9603
Frequency 4 9610
Proportion 0 1
--------------------------------------------------------------------------------
ADATE_7
n missing distinct Info Mean Gmd
9088 912 3 0.165 9602 0.4426
Value 9512 9601 9602
Frequency 17 515 8556
Proportion 0.002 0.057 0.941
--------------------------------------------------------------------------------
ADATE_8
n missing distinct Info Mean Gmd
9627 373 4 0.19 9595 11.23
Value 9511 9512 9601 9603
Frequency 5 646 8973 3
Proportion 0.001 0.067 0.932 0.000
--------------------------------------------------------------------------------
ADATE_9
n missing distinct Info Mean Gmd
8818 1182 3 0.113 9511 0.1354
Value 9509 9510 9511
Frequency 272 75 8471
Proportion 0.031 0.009 0.961
--------------------------------------------------------------------------------
ADATE_10
n missing distinct Info Mean Gmd
6655 3345 2 0.215 9510 0.1433
Value 9510 9511
Frequency 6138 517
Proportion 0.922 0.078
--------------------------------------------------------------------------------
ADATE_11
n missing distinct Info Mean Gmd
8919 1081 4 0.651 9510 0.4792
Value 9508 9509 9510 9511
Frequency 6 2294 6173 446
Proportion 0.001 0.257 0.692 0.050
--------------------------------------------------------------------------------
ADATE_12
n missing distinct Info Mean Gmd
9071 929 4 0.742 9508 0.5096
Value 9507 9508 9509 9510
Frequency 5 5146 3839 81
Proportion 0.001 0.567 0.423 0.009
--------------------------------------------------------------------------------
ADATE_13
n missing distinct Info Mean Gmd
5744 4256 3 0.024 9507 0.05884
Value 9502 9506 9507
Frequency 31 15 5698
Proportion 0.005 0.003 0.992
--------------------------------------------------------------------------------
ADATE_14
n missing distinct Info Mean Gmd
8021 1979 2 0.005 9506 0.006473
Value 9504 9506
Frequency 13 8008
Proportion 0.002 0.998
--------------------------------------------------------------------------------
ADATE_15
n missing distinct Info Mean Gmd
3078 6922 1 0 9504 0
Value 9504
Frequency 3078
Proportion 1
--------------------------------------------------------------------------------
ADATE_16
n missing distinct Info Mean Gmd
7895 2105 3 0.062 9503 0.04179
Value 9502 9503 9504
Frequency 13 7727 155
Proportion 0.002 0.979 0.020
--------------------------------------------------------------------------------
ADATE_17
n missing distinct Info Mean Gmd
7083 2917 3 0.217 9502 0.1447
Value 9501 9502 9503
Frequency 555 6527 1
Proportion 0.078 0.922 0.000
--------------------------------------------------------------------------------
ADATE_18
n missing distinct Info Mean Gmd
7709 2291 6 0.724 9465 42.9
lowest : 9409 9411 9412 9501 9504, highest: 9411 9412 9501 9504 9505
Value 9409 9411 9412 9501 9504 9505
Frequency 3 8 3109 4586 1 2
Proportion 0.000 0.001 0.403 0.595 0.000 0.000
--------------------------------------------------------------------------------
ADATE_19
n missing distinct Info Mean Gmd
7462 2538 3 0.064 9411 0.08317
Value 9409 9410 9411
Frequency 154 9 7299
Proportion 0.021 0.001 0.978
--------------------------------------------------------------------------------
ADATE_20
n missing distinct Info Mean Gmd
4753 5247 2 0.001 9411 0.0004208
Value 9411 9412
Frequency 4752 1
Proportion 1 0
--------------------------------------------------------------------------------
ADATE_21
n missing distinct Info Mean Gmd
6342 3658 2 0.195 9410 0.13
Value 9409 9410
Frequency 443 5899
Proportion 0.07 0.93
--------------------------------------------------------------------------------
ADATE_22
n missing distinct Info Mean Gmd
7304 2696 4 0.224 9410 2.233
Value 9408 9409 9410 9506
Frequency 512 6710 2 80
Proportion 0.070 0.919 0.000 0.011
--------------------------------------------------------------------------------
ADATE_23
n missing distinct Info Mean Gmd
4123 5877 3 0.022 9407 0.06053
Value 9312 9406 9407
Frequency 1 30 4092
Proportion 0.000 0.007 0.992
--------------------------------------------------------------------------------
ADATE_24
n missing distinct Info Mean Gmd
6186 3814 2 0.015 9406 0.01029
Value 9405 9406
Frequency 32 6154
Proportion 0.005 0.995
--------------------------------------------------------------------------------
RFA_2
n missing distinct
10000 0 14
lowest : L1E L1F L1G L2E L2F, highest: L3G L4D L4E L4F L4G
Value L1E L1F L1G L2E L2F L2G L3D L3E L3F L3G L4D
Frequency 550 3237 1269 536 1174 476 234 823 363 149 504
Proportion 0.055 0.324 0.127 0.054 0.117 0.048 0.023 0.082 0.036 0.015 0.050
Value L4E L4F L4G
Frequency 388 206 91
Proportion 0.039 0.021 0.009
--------------------------------------------------------------------------------
RFA_3
n missing distinct
9798 202 60
lowest : A1D A1E A1F A1G A2D, highest: S3G S4D S4E S4F S4G
--------------------------------------------------------------------------------
RFA_4
n missing distinct
9769 231 56
lowest : A1D A1E A1F A1G A2D, highest: S3G S4D S4E S4F S4G
--------------------------------------------------------------------------------
RFA_5
n missing distinct
6457 3543 39
lowest : A1F A1G A2E A2F A2G, highest: S3G S4D S4E S4F S4G
--------------------------------------------------------------------------------
RFA_6
n missing distinct
9614 386 92
lowest : A1A A1C A1D A1E A1F, highest: S4F S4G U1E U1F U1G
--------------------------------------------------------------------------------
RFA_7
n missing distinct
9088 912 89
lowest : A1C A1D A1E A1F A1G, highest: S4C S4D S4E S4F S4G
--------------------------------------------------------------------------------
RFA_8
n missing distinct
9627 373 92
lowest : A1B A1C A1D A1E A1F, highest: S4F S4G U1E U1F U1G
--------------------------------------------------------------------------------
RFA_9
n missing distinct
8818 1182 99
lowest : A1A A1B A1C A1D A1E, highest: S4E S4F S4G U1F U1G
--------------------------------------------------------------------------------
RFA_10
n missing distinct
6655 3345 79
lowest : A1C A1D A1E A1F A1G, highest: S4C S4D S4E S4F S4G
--------------------------------------------------------------------------------
RFA_11
n missing distinct
8919 1081 95
lowest : A1B A1C A1D A1E A1F, highest: S4C S4D S4E S4F S4G
--------------------------------------------------------------------------------
RFA_12
n missing distinct
9071 929 99
lowest : A1B A1C A1D A1E A1F, highest: S4G U1D U1E U1F U1G
--------------------------------------------------------------------------------
RFA_13
n missing distinct
5744 4256 65
lowest : A1E A1F A1G A2D A2E, highest: S4C S4D S4E S4F S4G
--------------------------------------------------------------------------------
RFA_14
n missing distinct
8021 1979 87
lowest : A1C A1D A1E A1F A1G, highest: S4D S4E S4F S4G U1D
--------------------------------------------------------------------------------
RFA_15
n missing distinct
3078 6922 31
lowest : A1G A2F A2G A3D A3E, highest: S3G S4D S4E S4F S4G
--------------------------------------------------------------------------------
RFA_16
n missing distinct
7890 2110 106
lowest : A1A A1B A1C A1D A1E, highest: U1C U1D U1E U1F U1G
--------------------------------------------------------------------------------
RFA_17
n missing distinct
7083 2917 105
lowest : A1B A1C A1D A1E A1F, highest: S4F S4G U1D U1E U1F
--------------------------------------------------------------------------------
RFA_18
n missing distinct
7709 2291 108
lowest : A1B A1C A1D A1E A1F, highest: S4G U1D U1E U1F U1G
--------------------------------------------------------------------------------
RFA_19
n missing distinct
7461 2539 101
lowest : A1A A1B A1C A1D A1E, highest: S4F S4G U1D U1E U1F
--------------------------------------------------------------------------------
RFA_20
n missing distinct
4753 5247 67
lowest : A1C A1D A1E A1F A1G, highest: S4C S4D S4E S4F S4G
--------------------------------------------------------------------------------
RFA_21
n missing distinct
6342 3658 96
lowest : A1B A1C A1D A1E A1F, highest: S4C S4D S4E S4F S4G
--------------------------------------------------------------------------------
RFA_22
n missing distinct
7304 2696 108
lowest : A1B A1C A1D A1E A1F, highest: S4F S4G U1E U1F U1G
--------------------------------------------------------------------------------
RFA_23
n missing distinct
4122 5878 58
lowest : 2F A1E A1F A1G A2C, highest: S4C S4D S4E S4F S4G
--------------------------------------------------------------------------------
RFA_24
n missing distinct
6186 3814 84
lowest : A1D A1E A1F A1G A2C, highest: S4F S4G U1D U1E U1F
--------------------------------------------------------------------------------
CARDPROM
n missing distinct Info Mean Gmd .05 .10
10000 0 47 0.999 18.4 9.845 5 6
.25 .50 .75 .90 .95
11 18 25 30 32
lowest : 2 3 4 5 6, highest: 44 45 46 47 50
--------------------------------------------------------------------------------
MAXADATE
n missing distinct Info Mean Gmd
10000 0 3 0.012 9702 0.7184
Value 9611 9612 9702
Frequency 6 34 9960
Proportion 0.001 0.003 0.996
--------------------------------------------------------------------------------
NUMPROM
n missing distinct Info Mean Gmd .05 .10
10000 0 134 1 46.91 25.9 14 16
.25 .50 .75 .90 .95
27 47 64 75 82
lowest : 5 6 7 8 9, highest: 137 142 148 163 170
--------------------------------------------------------------------------------
CARDPM12
n missing distinct Info Mean Gmd .05 .10
10000 0 15 0.836 5.335 1.17 3 4
.25 .50 .75 .90 .95
5 6 6 6 6
lowest : 0 1 2 3 4, highest: 10 11 12 14 15
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 4 29 153 421 1730 1980 5319 156 87 44 23
Proportion 0.000 0.003 0.015 0.042 0.173 0.198 0.532 0.016 0.009 0.004 0.002
Value 11 12 14 15
Frequency 49 3 1 1
Proportion 0.005 0.000 0.000 0.000
--------------------------------------------------------------------------------
NUMPRM12
n missing distinct Info Mean Gmd .05 .10
10000 0 48 0.971 12.83 3.825 8 9
.25 .50 .75 .90 .95
11 12 13 15 22
lowest : 1 2 3 4 5, highest: 45 46 48 50 53
--------------------------------------------------------------------------------
RDATE_3
n missing distinct Info Mean Gmd
18 9982 5 0.902 9634 47.48
lowest : 9606 9607 9610 9704 9806, highest: 9606 9607 9610 9704 9806
Value 9606 9607 9610 9704 9806
Frequency 6 7 1 3 1
Proportion 0.333 0.389 0.056 0.167 0.056
--------------------------------------------------------------------------------
RDATE_4
n missing distinct Info Mean Gmd
20 9980 9 0.968 9611 30.12
lowest : 9512 9601 9602 9604 9605, highest: 9605 9606 9609 9610 9704
Value 9512 9601 9602 9604 9605 9606 9609 9610 9704
Frequency 1 1 2 2 6 2 3 1 2
Proportion 0.05 0.05 0.10 0.10 0.30 0.10 0.15 0.05 0.10
--------------------------------------------------------------------------------
RDATE_5
n missing distinct Info Mean Gmd
2 9998 2 1 9654 99
Value 9604 9703
Frequency 1 1
Proportion 0.5 0.5
--------------------------------------------------------------------------------
RDATE_6
n missing distinct Info Mean Gmd
93 9907 7 0.41 9592 19.46
lowest : 9510 9511 9512 9603 9604, highest: 9512 9603 9604 9605 9606
Value 9510 9511 9512 9603 9604 9605 9606
Frequency 1 4 6 78 2 1 1
Proportion 0.011 0.043 0.065 0.839 0.022 0.011 0.011
--------------------------------------------------------------------------------
RDATE_7
n missing distinct Info Mean Gmd
945 9055 3 0.565 9603 0.3794
Value 9601 9602 9603
Frequency 1 237 707
Proportion 0.001 0.251 0.748
--------------------------------------------------------------------------------
RDATE_8
n missing distinct Info Mean Gmd
2243 7757 7 0.814 9601 1.15
lowest : 9511 9512 9601 9602 9603, highest: 9601 9602 9603 9605 9606
Value 9511 9512 9601 9602 9603 9605 9606
Frequency 1 5 1051 975 209 1 1
Proportion 0.000 0.002 0.469 0.435 0.093 0.000 0.000
--------------------------------------------------------------------------------
RDATE_9
n missing distinct Info Mean Gmd
1763 8237 7 0.493 9524 21.48
lowest : 9509 9510 9511 9512 9601, highest: 9511 9512 9601 9602 9603
Value 9509 9510 9511 9512 9601 9602 9603
Frequency 7 68 40 1405 178 38 27
Proportion 0.004 0.039 0.023 0.797 0.101 0.022 0.015
--------------------------------------------------------------------------------
RDATE_10
n missing distinct Info Mean Gmd
1094 8906 5 0.807 9521 16.94
lowest : 9511 9512 9601 9602 9603, highest: 9511 9512 9601 9602 9603
Value 9511 9512 9601 9602 9603
Frequency 564 418 81 22 9
Proportion 0.516 0.382 0.074 0.020 0.008
--------------------------------------------------------------------------------
RDATE_11
n missing distinct Info Mean Gmd
1523 8477 8 0.862 9517 11.87
lowest : 9509 9510 9511 9512 9601, highest: 9512 9601 9602 9603 9605
Value 9509 9510 9511 9512 9601 9602 9603 9605
Frequency 6 451 725 240 67 19 14 1
Proportion 0.004 0.296 0.476 0.158 0.044 0.012 0.009 0.001
--------------------------------------------------------------------------------
RDATE_12
n missing distinct Info Mean Gmd
2594 7406 7 0.84 9514 9.442
lowest : 9509 9510 9511 9512 9601, highest: 9511 9512 9601 9602 9603
Value 9509 9510 9511 9512 9601 9602 9603
Frequency 1357 638 207 265 87 28 12
Proportion 0.523 0.246 0.080 0.102 0.034 0.011 0.005
--------------------------------------------------------------------------------
RDATE_13
n missing distinct Info Mean Gmd .05 .10
1271 8729 12 0.772 9509 3.194 9507 9507
.25 .50 .75 .90 .95
9508 9508 9508 9509 9510
lowest : 9502 9503 9504 9507 9508, highest: 9511 9512 9601 9602 9603
Value 9502 9503 9504 9507 9508 9509 9510 9511 9512 9601 9602
Frequency 4 4 1 266 765 134 46 23 12 8 3
Proportion 0.003 0.003 0.001 0.209 0.602 0.105 0.036 0.018 0.009 0.006 0.002
Value 9603
Frequency 5
Proportion 0.004
--------------------------------------------------------------------------------
RDATE_14
n missing distinct Info Mean Gmd .05 .10
2354 7646 12 0.781 9508 3.229 9506 9506
.25 .50 .75 .90 .95
9506 9506 9507 9508 9510
lowest : 9406 9505 9506 9507 9508, highest: 9511 9512 9601 9602 9603
Value 9406 9505 9506 9507 9508 9509 9510 9511 9512 9601 9602
Frequency 1 1 1380 613 159 80 38 23 32 18 4
Proportion 0.000 0.000 0.586 0.260 0.068 0.034 0.016 0.010 0.014 0.008 0.002
Value 9603
Frequency 5
Proportion 0.002
--------------------------------------------------------------------------------
RDATE_15
n missing distinct Info Mean Gmd .05 .10
742 9258 13 0.689 9506 2.452 9505 9505
.25 .50 .75 .90 .95
9505 9505 9506 9507 9509
lowest : 9412 9502 9504 9505 9506, highest: 9510 9511 9512 9601 9603
Value 9412 9502 9504 9505 9506 9507 9508 9509 9510 9511 9512
Frequency 1 1 7 497 161 26 11 15 4 8 6
Proportion 0.001 0.001 0.009 0.670 0.217 0.035 0.015 0.020 0.005 0.011 0.008
Value 9601 9603
Frequency 4 1
Proportion 0.005 0.001
--------------------------------------------------------------------------------
RDATE_16
n missing distinct Info Mean Gmd .05 .10
2871 7129 14 0.757 9506 3.303 9504 9504
.25 .50 .75 .90 .95
9504 9504 9505 9506 9508
lowest : 9411 9503 9504 9505 9506, highest: 9511 9512 9601 9602 9603
Value 9411 9503 9504 9505 9506 9507 9508 9509 9510 9511 9512
Frequency 1 142 1769 589 163 59 36 30 19 10 20
Proportion 0.000 0.049 0.616 0.205 0.057 0.021 0.013 0.010 0.007 0.003 0.007
Value 9601 9602 9603
Frequency 22 7 4
Proportion 0.008 0.002 0.001
--------------------------------------------------------------------------------
RDATE_17
n missing distinct Info Mean Gmd .05 .10
935 9065 11 0.853 9503 1.34 9502 9502
.25 .50 .75 .90 .95
9502 9503 9503 9505 9506
lowest : 9502 9503 9504 9505 9506, highest: 9508 9509 9510 9511 9512
Value 9502 9503 9504 9505 9506 9507 9508 9509 9510 9511 9512
Frequency 262 465 106 35 27 11 8 7 2 2 10
Proportion 0.280 0.497 0.113 0.037 0.029 0.012 0.009 0.007 0.002 0.002 0.011
--------------------------------------------------------------------------------
RDATE_18
n missing distinct Info Mean Gmd .05 .10
1970 8030 14 0.833 9502 3.116 9501 9501
.25 .50 .75 .90 .95
9501 9501 9502 9505 9507
lowest : 9412 9501 9502 9503 9504, highest: 9509 9510 9511 9512 9601
Value 9412 9501 9502 9503 9504 9505 9506 9507 9508 9509 9510
Frequency 15 1050 491 147 52 43 54 58 20 15 8
Proportion 0.008 0.533 0.249 0.075 0.026 0.022 0.027 0.029 0.010 0.008 0.004
Value 9511 9512 9601
Frequency 8 8 1
Proportion 0.004 0.004 0.001
--------------------------------------------------------------------------------
RDATE_19
n missing distinct Info Mean Gmd .05 .10
1726 8274 13 0.5 9424 21.62 9411 9412
.25 .50 .75 .90 .95
9412 9412 9412 9501 9502
lowest : 9409 9410 9411 9412 9501, highest: 9505 9506 9507 9508 9509
Value 9409 9410 9411 9412 9501 9502 9503 9504 9505 9506 9507
Frequency 1 83 36 1369 137 43 21 9 5 9 4
Proportion 0.001 0.048 0.021 0.793 0.079 0.025 0.012 0.005 0.003 0.005 0.002
Value 9508 9509
Frequency 4 5
Proportion 0.002 0.003
--------------------------------------------------------------------------------
RDATE_20
n missing distinct Info Mean Gmd .05 .10
841 9159 10 0.768 9421 17.38 9411 9411
.25 .50 .75 .90 .95
9411 9412 9412 9501 9502
lowest : 9411 9412 9501 9502 9503, highest: 9504 9505 9506 9507 9508
Value 9411 9412 9501 9502 9503 9504 9505 9506 9507 9508
Frequency 259 494 42 20 11 4 5 2 2 2
Proportion 0.308 0.587 0.050 0.024 0.013 0.005 0.006 0.002 0.002 0.002
--------------------------------------------------------------------------------
RDATE_21
n missing distinct Info Mean Gmd .05 .10
962 9038 11 0.838 9418 14.09 9410 9410
.25 .50 .75 .90 .95
9410 9411 9411 9412 9502
lowest : 9410 9411 9412 9501 9502, highest: 9504 9505 9506 9507 9508
Value 9410 9411 9412 9501 9502 9503 9504 9505 9506 9507 9508
Frequency 246 502 137 25 33 2 4 4 3 3 3
Proportion 0.256 0.522 0.142 0.026 0.034 0.002 0.004 0.004 0.003 0.003 0.003
--------------------------------------------------------------------------------
RDATE_22
n missing distinct Info Mean Gmd .05 .10
2165 7835 12 0.834 9414 9.287 9409 9409
.25 .50 .75 .90 .95
9409 9409 9410 9412 9412
lowest : 9409 9410 9411 9412 9501, highest: 9504 9505 9506 9507 9508
Value 9409 9410 9411 9412 9501 9502 9503 9504 9505 9506 9507
Frequency 1150 531 193 188 45 26 9 6 5 6 1
Proportion 0.531 0.245 0.089 0.087 0.021 0.012 0.004 0.003 0.002 0.003 0.000
Value 9508
Frequency 5
Proportion 0.002
--------------------------------------------------------------------------------
RDATE_23
n missing distinct Info Mean Gmd .05 .10
778 9222 15 0.803 9410 4.442 9407 9407
.25 .50 .75 .90 .95
9408 9408 9408 9409 9410
lowest : 9403 9404 9407 9408 9409, highest: 9503 9504 9505 9506 9507
Value 9403 9404 9407 9408 9409 9410 9411 9412 9501 9502 9503
Frequency 1 1 184 441 86 31 14 5 2 4 2
Proportion 0.001 0.001 0.237 0.567 0.111 0.040 0.018 0.006 0.003 0.005 0.003
Value 9504 9505 9506 9507
Frequency 4 1 1 1
Proportion 0.005 0.001 0.001 0.001
--------------------------------------------------------------------------------
RDATE_24
n missing distinct Info Mean Gmd .05 .10
1846 8154 13 0.849 9408 3.772 9406 9406
.25 .50 .75 .90 .95
9406 9407 9407 9408 9410
lowest : 9309 9310 9406 9407 9408, highest: 9412 9501 9502 9503 9504
Value 9309 9310 9406 9407 9408 9409 9410 9411 9412 9501 9502
Frequency 2 2 716 833 122 68 42 15 23 6 9
Proportion 0.001 0.001 0.388 0.451 0.066 0.037 0.023 0.008 0.012 0.003 0.005
Value 9503 9504
Frequency 2 6
Proportion 0.001 0.003
--------------------------------------------------------------------------------
RAMNT_3
n missing distinct Info Mean Gmd .05 .10
18 9982 11 0.987 12.89 11.23 2.00 2.70
.25 .50 .75 .90 .95
7.00 8.50 18.50 21.50 28.75
lowest : 2 3 7 8 9, highest: 12 14 20 25 50
Value 2 3 7 8 9 10 12 14 20 25 50
Frequency 2 1 3 3 1 1 1 1 3 1 1
Proportion 0.111 0.056 0.167 0.167 0.056 0.056 0.056 0.056 0.167 0.056 0.056
--------------------------------------------------------------------------------
RAMNT_4
n missing distinct Info Mean Gmd .05 .10
20 9980 10 0.958 9.338 6.998 1.95 2.90
.25 .50 .75 .90 .95
5.00 9.50 11.25 17.09 20.25
lowest : 1.00 2.00 3.00 5.00 9.00, highest: 10.00 15.00 16.77 20.00 25.00
Value 1.00 2.00 3.00 5.00 9.00 10.00 15.00 16.77 20.00 25.00
Frequency 1 1 1 6 1 5 2 1 1 1
Proportion 0.05 0.05 0.05 0.30 0.05 0.25 0.10 0.05 0.05 0.05
--------------------------------------------------------------------------------
RAMNT_5
n missing distinct Info Mean Gmd
2 9998 2 1 31 38
Value 12 50
Frequency 1 1
Proportion 0.5 0.5
--------------------------------------------------------------------------------
RAMNT_6
n missing distinct Info Mean Gmd .05 .10
93 9907 28 0.988 13.28 8.339 5 5
.25 .50 .75 .90 .95
8 11 17 21 27
lowest : 1.00 2.00 2.86 4.00 5.00, highest: 27.00 30.00 35.00 40.00 50.00
--------------------------------------------------------------------------------
RAMNT_7
n missing distinct Info Mean Gmd .05 .10
945 9055 43 0.985 15.45 9.993 5 5
.25 .50 .75 .90 .95
10 15 20 25 30
lowest : 2 3 4 5 6, highest: 52 60 103 130 250
--------------------------------------------------------------------------------
RAMNT_8
n missing distinct Info Mean Gmd .05 .10
2243 7757 51 0.986 15.66 9.559 5.0 6.0
.25 .50 .75 .90 .95
10.0 15.0 20.0 25.0 26.9
lowest : 2.00 3.00 4.00 5.00 5.32, highest: 100.00 103.00 150.00 275.00 500.00
--------------------------------------------------------------------------------
RAMNT_9
n missing distinct Info Mean Gmd .05 .10
1763 8237 45 0.988 14.99 8.75 5 5
.25 .50 .75 .90 .95
10 15 20 25 27
lowest : 2 3 4 5 6, highest: 52 55 60 100 102
--------------------------------------------------------------------------------
RAMNT_10
n missing distinct Info Mean Gmd .05 .10
1094 8906 44 0.989 15.31 9.906 5 5
.25 .50 .75 .90 .95
10 14 20 25 30
lowest : 1.00 2.00 3.00 4.00 4.73, highest: 46.00 50.00 60.00 75.00 100.00
--------------------------------------------------------------------------------
RAMNT_11
n missing distinct Info Mean Gmd .05 .10
1523 8477 48 0.988 14.17 9.363 4.0 5.0
.25 .50 .75 .90 .95
8.0 12.0 20.0 25.0 26.9
lowest : 1 2 3 4 5, highest: 50 60 100 102 150
--------------------------------------------------------------------------------
RAMNT_12
n missing distinct Info Mean Gmd .05 .10
2594 7406 54 0.988 14.81 9.071 5 5
.25 .50 .75 .90 .95
10 13 20 25 28
lowest : 1.00 2.00 3.00 4.00 5.00, highest: 70.00 75.00 88.14 100.00 250.00
--------------------------------------------------------------------------------
RAMNT_13
n missing distinct Info Mean Gmd .05 .10
1271 8729 46 0.988 13.28 8.956 4 5
.25 .50 .75 .90 .95
7 10 16 25 29
lowest : 1 2 3 4 5, highest: 50 52 53 81 100
--------------------------------------------------------------------------------
RAMNT_14
n missing distinct Info Mean Gmd .05 .10
2354 7646 46 0.984 13.24 8.594 5 5
.25 .50 .75 .90 .95
7 10 16 25 25
lowest : 2 3 4 5 6, highest: 50 55 72 75 100
--------------------------------------------------------------------------------
RAMNT_15
n missing distinct Info Mean Gmd .05 .10
742 9258 41 0.989 12.88 8.841 4 5
.25 .50 .75 .90 .95
7 10 16 25 30
lowest : 2 3 4 5 6, highest: 44 45 50 51 75
--------------------------------------------------------------------------------
RAMNT_16
n missing distinct Info Mean Gmd .05 .10
2871 7129 50 0.991 13.81 8.422 5 5
.25 .50 .75 .90 .95
9 12 17 25 27
lowest : 1 2 3 4 5, highest: 60 63 75 100 200
--------------------------------------------------------------------------------
RAMNT_17
n missing distinct Info Mean Gmd .05 .10
935 9065 41 0.98 12.75 8.437 5.0 5.0
.25 .50 .75 .90 .95
7.0 10.0 15.0 24.6 25.0
lowest : 2 3 4 5 6, highest: 42 50 65 75 100
--------------------------------------------------------------------------------
RAMNT_18
n missing distinct Info Mean Gmd .05 .10
1970 8030 45 0.982 12.18 8.016 4 5
.25 .50 .75 .90 .95
7 10 15 20 25
lowest : 2 3 4 5 6, highest: 45 50 60 100 200
--------------------------------------------------------------------------------
RAMNT_19
n missing distinct Info Mean Gmd .05 .10
1726 8274 43 0.987 12.74 7.957 5 5
.25 .50 .75 .90 .95
7 10 15 20 25
lowest : 1 2 3 4 5, highest: 52 60 77 100 500
--------------------------------------------------------------------------------
RAMNT_20
n missing distinct Info Mean Gmd .05 .10
841 9159 43 0.991 14.82 9.512 5 5
.25 .50 .75 .90 .95
10 12 20 25 30
lowest : 2 3 4 5 6, highest: 46 47 50 60 100
--------------------------------------------------------------------------------
RAMNT_21
n missing distinct Info Mean Gmd .05 .10
962 9038 42 0.981 12.48 8.283 4 5
.25 .50 .75 .90 .95
7 10 15 21 25
lowest : 1 2 3 4 5, highest: 48 50 56 100 120
--------------------------------------------------------------------------------
RAMNT_22
n missing distinct Info Mean Gmd .05 .10
2165 7835 47 0.986 12.11 7.66 3 5
.25 .50 .75 .90 .95
7 10 15 20 25
lowest : 1 2 3 4 5, highest: 47 50 55 75 100
--------------------------------------------------------------------------------
RAMNT_23
n missing distinct Info Mean Gmd .05 .10
778 9222 38 0.987 12.03 7.909 3 5
.25 .50 .75 .90 .95
6 10 15 20 25
lowest : 2 3 4 5 6, highest: 40 45 50 60 125
--------------------------------------------------------------------------------
RAMNT_24
n missing distinct Info Mean Gmd .05 .10
1846 8154 43 0.98 11.2 7.282 3 5
.25 .50 .75 .90 .95
5 10 15 20 25
lowest : 1 2 3 4 5, highest: 48 50 51 60 100
--------------------------------------------------------------------------------
RAMNTALL
n missing distinct Info Mean Gmd .05 .10
10000 0 662 1 102.3 88.56 20.0 25.0
.25 .50 .75 .90 .95
40.0 78.0 130.0 202.1 270.0
lowest : 13 15 16 17 18, highest: 1405 1435 1655 1725 2038
--------------------------------------------------------------------------------
NGIFTALL
n missing distinct Info Mean Gmd .05 .10
10000 0 63 0.996 9.546 8.648 1 1
.25 .50 .75 .90 .95
3 7 13 20 26
lowest : 1 2 3 4 5, highest: 69 71 84 91 95
--------------------------------------------------------------------------------
CARDGIFT
n missing distinct Info Mean Gmd .05 .10
10000 0 30 0.988 5 4.709 0 1
.25 .50 .75 .90 .95
2 4 7 11 14
lowest : 0 1 2 3 4, highest: 25 26 27 29 32
--------------------------------------------------------------------------------
MINRAMNT
n missing distinct Info Mean Gmd .05 .10
10000 0 65 0.94 7.876 6.562 2 3
.25 .50 .75 .90 .95
3 5 10 20 20
lowest : 0.00 0.01 0.02 0.05 0.06, highest: 60.00 88.14 100.00 103.00 150.00
--------------------------------------------------------------------------------
MINRDATE
n missing distinct Info Mean Gmd .05 .10
10000 0 132 1 9250 302.6 8712 8810
.25 .50 .75 .90 .95
9012 9309 9503 9512 9602
lowest : 7506 7712 8012 8402 8607, highest: 9610 9611 9612 9701 9702
--------------------------------------------------------------------------------
MAXRAMNT
n missing distinct Info Mean Gmd .05 .10
10000 0 98 0.982 19.71 10.51 8 10
.25 .50 .75 .90 .95
14 17 22 29 37
lowest : 5.00 5.25 6.00 7.00 8.00, highest: 156.00 200.00 250.00 275.00 500.00
--------------------------------------------------------------------------------
MAXRDATE
n missing distinct Info Mean Gmd .05 .10
10000 0 128 0.997 9441 161.1 9012 9206
.25 .50 .75 .90 .95
9409 9506 9512 9602 9602
lowest : 8411 8504 8602 8608 8609, highest: 9610 9611 9612 9701 9702
--------------------------------------------------------------------------------
LASTGIFT
n missing distinct Info Mean Gmd .05 .10
10000 0 82 0.984 17.16 9.873 5 7
.25 .50 .75 .90 .95
10 15 20 25 30
lowest : 0 1 2 3 4, highest: 130 150 250 275 500
--------------------------------------------------------------------------------
LASTDATE
n missing distinct Info Mean Gmd .05 .10
10000 0 24 0.984 9548 49.2 9505 9506
.25 .50 .75 .90 .95
9509 9512 9602 9603 9603
lowest : 9503 9504 9505 9506 9507, highest: 9610 9611 9612 9701 9702
--------------------------------------------------------------------------------
FISTDATE
n missing distinct Info Mean Gmd .05 .10
10000 0 132 1 9135 362.8 8610 8701
.25 .50 .75 .90 .95
8810 9201 9409 9509 9601
lowest : 7211 7712 7804 7903 8007, highest: 9511 9512 9601 9602 9603
--------------------------------------------------------------------------------
NEXTDATE
n missing distinct Info Mean Gmd .05 .10
8963 1037 140 1 9150 337.2 8703 8707
.25 .50 .75 .90 .95
8903 9204 9409 9508 9512
lowest : 7211 7804 7810 8005 8011, highest: 9608 9609 9610 9612 9702
--------------------------------------------------------------------------------
TIMELAG
n missing distinct Info Mean Gmd .05 .10
8963 1037 51 0.995 8.04 6.26 1 2
.25 .50 .75 .90 .95
4 6 11 15 20
lowest : 0 1 2 3 4, highest: 48 49 56 62 90
--------------------------------------------------------------------------------
AVGGIFT
n missing distinct Info Mean Gmd .05 .10
10000 0 2266 1 13.2 7.426 5.000 6.066
.25 .50 .75 .90 .95
8.400 11.615 15.337 21.000 25.000
lowest : 1.574324 2.069767 2.071429 2.125000 2.205882
highest: 103.000000 150.000000 191.666667 200.000000 215.625000
--------------------------------------------------------------------------------
CONTROLN
n missing distinct Info Mean Gmd .05 .10
10000 0 10000 1 95917 63825 10222 19677
.25 .50 .75 .90 .95
48131 95716 144420 172345 182048
lowest : 27 65 94 103 107, highest: 191649 191673 191716 191754 191758
--------------------------------------------------------------------------------
TARGET_B
n missing distinct Info Sum Mean Gmd
10000 0 2 0.145 509 0.0509 0.09663
--------------------------------------------------------------------------------
TARGET_D
n missing distinct Info Mean Gmd .05 .10
10000 0 44 0.145 0.7419 1.432 0 0
.25 .50 .75 .90 .95
0 0 0 0 3
lowest : 0.0 1.0 2.0 2.5 3.0, highest: 45.0 50.0 51.0 60.0 100.0
--------------------------------------------------------------------------------
HPHONE_D
n missing distinct Info Sum Mean Gmd
10000 0 2 0.75 5118 0.5118 0.4998
--------------------------------------------------------------------------------
RFA_2R
n missing distinct value
10000 0 1 L
Value L
Frequency 10000
Proportion 1
--------------------------------------------------------------------------------
RFA_2F
n missing distinct Info Mean Gmd
10000 0 4 0.855 1.889 1.109
Value 1 2 3 4
Frequency 5056 2186 1569 1189
Proportion 0.506 0.219 0.157 0.119
--------------------------------------------------------------------------------
RFA_2A
n missing distinct
10000 0 4
Value D E F G
Frequency 738 2297 4980 1985
Proportion 0.074 0.230 0.498 0.198
--------------------------------------------------------------------------------
MDMAUD_R
n missing distinct
10000 0 5
lowest : C D I L X, highest: C D I L X
Value C D I L X
Frequency 7 4 5 4 9980
Proportion 0.001 0.000 0.000 0.000 0.998
--------------------------------------------------------------------------------
MDMAUD_F
n missing distinct
10000 0 3
Value 1 2 X
Frequency 12 8 9980
Proportion 0.001 0.001 0.998
--------------------------------------------------------------------------------
MDMAUD_A
n missing distinct
10000 0 4
Value C L M X
Frequency 17 1 2 9980
Proportion 0.002 0.000 0.000 0.998
--------------------------------------------------------------------------------
CLUSTER2
n missing distinct Info Mean Gmd .05 .10
9989 11 62 1 31.59 21.53 4.0 5.0
.25 .50 .75 .90 .95
15.0 32.0 49.0 57.2 60.0
lowest : 1 2 3 4 5, highest: 58 59 60 61 62
--------------------------------------------------------------------------------
GEOCODE2
n missing distinct
9964 36 4
Value A B C D
Frequency 3602 3055 1614 1693
Proportion 0.362 0.307 0.162 0.170
--------------------------------------------------------------------------------
describe(data.val)
data.val
479 Variables 10000 Observations
--------------------------------------------------------------------------------
ODATEDW
n missing distinct Info Mean Gmd .05 .10
10000 0 18 0.987 9144 388.4 8601 8601
.25 .50 .75 .90 .95
8801 9201 9501 9601 9601
lowest : 8601 8610 8701 8801 8901, highest: 9410 9501 9506 9601 9701
Value 8601 8610 8701 8801 8901 8909 9001 9101 9102 9110 9201
Frequency 1441 2 392 724 579 1 798 897 1 1 817
Proportion 0.144 0.000 0.039 0.072 0.058 0.000 0.080 0.090 0.000 0.000 0.082
Value 9301 9401 9410 9501 9506 9601 9701
Frequency 424 1211 1 1680 1 1029 1
Proportion 0.042 0.121 0.000 0.168 0.000 0.103 0.000
--------------------------------------------------------------------------------
OSOURCE
n missing distinct
9901 99 542
lowest : AAA AAD ABC ABD ABW, highest: WWF WWM YKA YKD ZOY
--------------------------------------------------------------------------------
TCODE
n missing distinct Info Mean Gmd .05 .10
10000 0 20 0.895 35.93 69.44 0 0
.25 .50 .75 .90 .95
0 1 2 28 28
lowest : 0 1 2 3 4, highest: 980 1002 4002 14002 28028
Value 0 1 2 3 4 10 13 22 28 30 42
Frequency 4297 2671 1825 78 29 1 4 1 852 1 6
Proportion 0.430 0.267 0.182 0.008 0.003 0.000 0.000 0.000 0.085 0.000 0.001
Value 45 72 93 116 980 1002 4002 14002 28028
Frequency 2 9 1 2 6 207 4 1 3
Proportion 0.000 0.001 0.000 0.000 0.001 0.021 0.000 0.000 0.000
--------------------------------------------------------------------------------
STATE
n missing distinct
10000 0 53
lowest : AA AK AL AP AR, highest: VA WA WI WV WY
--------------------------------------------------------------------------------
ZIP
n missing distinct
10000 0 6277
lowest : 00778 01030- 01776- 01930 02215 , highest: 99755 99762 99801 99824 99833
--------------------------------------------------------------------------------
MAILCODE
n missing distinct value
138 9862 1 B
Value B
Frequency 138
Proportion 1
--------------------------------------------------------------------------------
PVASTATE
n missing distinct
145 9855 2
Value E P
Frequency 1 144
Proportion 0.007 0.993
--------------------------------------------------------------------------------
DOB
n missing distinct Info Mean Gmd .05 .10
10000 0 785 0.985 2703 2427 0 0
.25 .50 .75 .90 .95
201 2601 4501 5606 6201
lowest : 0 1 4 8 10, highest: 7512 7601 7908 9605 9611
--------------------------------------------------------------------------------
NOEXCH
n missing distinct
9996 4 3
Value 0 1 X
Frequency 9969 25 2
Proportion 0.997 0.003 0.000
--------------------------------------------------------------------------------
RECINHSE
n missing distinct value
748 9252 1 X
Value X
Frequency 748
Proportion 1
--------------------------------------------------------------------------------
RECP3
n missing distinct value
218 9782 1 X
Value X
Frequency 218
Proportion 1
--------------------------------------------------------------------------------
RECPGVG
n missing distinct value
15 9985 1 X
Value X
Frequency 15
Proportion 1
--------------------------------------------------------------------------------
RECSWEEP
n missing distinct value
164 9836 1 X
Value X
Frequency 164
Proportion 1
--------------------------------------------------------------------------------
MDMAUD
n missing distinct
10000 0 12
lowest : C1CM C1LM C2CM C2MM D1CM, highest: D5MM I1CM I2CM L1CM XXXX
Value C1CM C1LM C2CM C2MM D1CM D1MM D2CM D5MM I1CM I2CM L1CM
Frequency 4 2 2 1 2 1 2 1 4 2 7
Proportion 0.000 0.000 0.000 0.000 0.000 0.000 0.000 0.000 0.000 0.000 0.001
Value XXXX
Frequency 9972
Proportion 0.997
--------------------------------------------------------------------------------
DOMAIN
n missing distinct
9752 248 16
lowest : C1 C2 C3 R1 R2, highest: T3 U1 U2 U3 U4
Value C1 C2 C3 R1 R2 R3 S1 S2 S3 T1 T2
Frequency 692 845 533 135 1434 486 1190 898 182 541 1252
Proportion 0.071 0.087 0.055 0.014 0.147 0.050 0.122 0.092 0.019 0.055 0.128
Value T3 U1 U2 U3 U4
Frequency 233 512 345 294 180
Proportion 0.024 0.053 0.035 0.030 0.018
--------------------------------------------------------------------------------
CLUSTER
n missing distinct Info Mean Gmd .05 .10
9752 248 53 0.999 27.71 16.74 3 8
.25 .50 .75 .90 .95
15 27 40 47 49
lowest : 1 2 3 4 5, highest: 49 50 51 52 53
--------------------------------------------------------------------------------
AGE
n missing distinct Info Mean Gmd .05 .10
7516 2484 80 1 61.85 19.14 34 39
.25 .50 .75 .90 .95
48 63 76 83 87
lowest : 1 2 18 22 23, highest: 94 95 96 97 98
--------------------------------------------------------------------------------
AGEFLAG
n missing distinct
6827 3173 2
Value E I
Frequency 5949 878
Proportion 0.871 0.129
--------------------------------------------------------------------------------
HOMEOWNR
n missing distinct
7654 2346 2
Value H U
Frequency 5480 2174
Proportion 0.716 0.284
--------------------------------------------------------------------------------
CHILD03
n missing distinct
147 9853 3
Value B F M
Frequency 7 39 101
Proportion 0.048 0.265 0.687
--------------------------------------------------------------------------------
CHILD07
n missing distinct
135 9865 3
Value B F M
Frequency 7 38 90
Proportion 0.052 0.281 0.667
--------------------------------------------------------------------------------
CHILD12
n missing distinct
181 9819 3
Value B F M
Frequency 12 44 125
Proportion 0.066 0.243 0.691
--------------------------------------------------------------------------------
CHILD18
n missing distinct
290 9710 3
Value B F M
Frequency 35 109 146
Proportion 0.121 0.376 0.503
--------------------------------------------------------------------------------
NUMCHLD
n missing distinct Info Mean Gmd
1238 8762 5 0.732 1.519 0.7378
lowest : 1 2 3 4 5, highest: 1 2 3 4 5
Value 1 2 3 4 5
Frequency 781 312 111 27 7
Proportion 0.631 0.252 0.090 0.022 0.006
--------------------------------------------------------------------------------
INCOME
n missing distinct Info Mean Gmd
7769 2231 7 0.974 3.909 2.101
lowest : 1 2 3 4 5, highest: 3 4 5 6 7
Value 1 2 3 4 5 6 7
Frequency 912 1364 906 1308 1672 831 776
Proportion 0.117 0.176 0.117 0.168 0.215 0.107 0.100
--------------------------------------------------------------------------------
GENDER
n missing distinct
9701 299 4
Value F J M U
Frequency 5399 52 4061 189
Proportion 0.557 0.005 0.419 0.019
--------------------------------------------------------------------------------
WEALTH1
n missing distinct Info Mean Gmd .05 .10
5321 4679 10 0.987 5.385 3.133 1 1
.25 .50 .75 .90 .95
3 6 8 9 9
lowest : 0 1 2 3 4, highest: 5 6 7 8 9
Value 0 1 2 3 4 5 6 7 8 9
Frequency 264 345 394 500 478 519 598 667 745 811
Proportion 0.050 0.065 0.074 0.094 0.090 0.098 0.112 0.125 0.140 0.152
--------------------------------------------------------------------------------
HIT
n missing distinct Info Mean Gmd .05 .10
10000 0 59 0.802 3.24 5.23 0 0
.25 .50 .75 .90 .95
0 0 3 11 17
lowest : 0 1 2 3 4, highest: 54 55 59 63 240
--------------------------------------------------------------------------------
MBCRAFT
n missing distinct Info Mean Gmd
4484 5516 5 0.306 0.1521 0.2759
lowest : 0 1 2 3 4, highest: 0 1 2 3 4
Value 0 1 2 3 4
Frequency 3968 383 106 21 6
Proportion 0.885 0.085 0.024 0.005 0.001
--------------------------------------------------------------------------------
MBGARDEN
n missing distinct Info Mean Gmd
4484 5516 4 0.152 0.05955 0.1133
Value 0 1 2 3
Frequency 4244 214 25 1
Proportion 0.946 0.048 0.006 0.000
--------------------------------------------------------------------------------
MBBOOKS
n missing distinct Info Mean Gmd .05 .10
4484 5516 10 0.86 1.101 1.506 0 0
.25 .50 .75 .90 .95
0 1 2 3 5
lowest : 0 1 2 3 4, highest: 5 6 7 8 9
Value 0 1 2 3 4 5 6 7 8 9
Frequency 2235 1105 496 278 138 92 52 41 26 21
Proportion 0.498 0.246 0.111 0.062 0.031 0.021 0.012 0.009 0.006 0.005
--------------------------------------------------------------------------------
MBCOLECT
n missing distinct Info Mean Gmd
4482 5518 6 0.137 0.05935 0.1139
lowest : 0 1 2 3 4, highest: 1 2 3 4 5
Value 0 1 2 3 4 5
Frequency 4267 173 36 4 1 1
Proportion 0.952 0.039 0.008 0.001 0.000 0.000
--------------------------------------------------------------------------------
MAGFAML
n missing distinct Info Mean Gmd
4484 5516 7 0.646 0.4509 0.7003
lowest : 0 1 2 3 4, highest: 2 3 4 5 6
Value 0 1 2 3 4 5 6
Frequency 3150 863 316 110 30 13 2
Proportion 0.702 0.192 0.070 0.025 0.007 0.003 0.000
--------------------------------------------------------------------------------
MAGFEM
n missing distinct Info Mean Gmd
4484 5516 4 0.307 0.128 0.2291
Value 0 1 2 3
Frequency 3966 464 52 2
Proportion 0.884 0.103 0.012 0.000
--------------------------------------------------------------------------------
MAGMALE
n missing distinct Info Mean Gmd
4484 5516 4 0.159 0.06178 0.1173
Value 0 1 2 3
Frequency 4233 227 22 2
Proportion 0.944 0.051 0.005 0.000
--------------------------------------------------------------------------------
PUBGARDN
n missing distinct Info Mean Gmd
4484 5516 5 0.273 0.1405 0.2588
lowest : 0 1 2 3 4, highest: 0 1 2 3 4
Value 0 1 2 3 4
Frequency 4032 318 98 28 8
Proportion 0.899 0.071 0.022 0.006 0.002
--------------------------------------------------------------------------------
PUBCULIN
n missing distinct Info Mean Gmd
4484 5516 5 0.325 0.1476 0.264
lowest : 0 1 2 3 4, highest: 0 1 2 3 4
Value 0 1 2 3 4
Frequency 3931 457 85 9 2
Proportion 0.877 0.102 0.019 0.002 0.000
--------------------------------------------------------------------------------
PUBHLTH
n missing distinct Info Mean Gmd .05 .10
4484 5516 10 0.732 0.7268 1.105 0 0
.25 .50 .75 .90 .95
0 0 1 2 4
lowest : 0 1 2 3 4, highest: 5 6 7 8 9
Value 0 1 2 3 4 5 6 7 8 9
Frequency 2865 833 344 201 132 66 31 8 1 3
Proportion 0.639 0.186 0.077 0.045 0.029 0.015 0.007 0.002 0.000 0.001
--------------------------------------------------------------------------------
PUBDOITY
n missing distinct Info Mean Gmd
4484 5516 9 0.359 0.2366 0.4284
lowest : 0 1 2 3 4, highest: 4 5 6 7 9
Value 0 1 2 3 4 5 6 7 9
Frequency 3865 387 130 45 26 17 10 3 1
Proportion 0.862 0.086 0.029 0.010 0.006 0.004 0.002 0.001 0.000
--------------------------------------------------------------------------------
PUBNEWFN
n missing distinct Info Mean Gmd .05 .10
4484 5516 10 0.502 0.3776 0.6479 0 0
.25 .50 .75 .90 .95
0 0 0 1 2
lowest : 0 1 2 3 4, highest: 5 6 7 8 9
Value 0 1 2 3 4 5 6 7 8 9
Frequency 3550 526 233 90 42 22 7 5 5 4
Proportion 0.792 0.117 0.052 0.020 0.009 0.005 0.002 0.001 0.001 0.001
--------------------------------------------------------------------------------
PUBPHOTO
n missing distinct Info Sum Mean Gmd
4484 5516 2 0.015 22 0.004906 0.009767
--------------------------------------------------------------------------------
PUBOPP
n missing distinct Info Mean Gmd
4484 5516 9 0.333 0.2252 0.4127
lowest : 0 1 2 3 4, highest: 4 5 6 8 9
Value 0 1 2 3 4 5 6 8 9
Frequency 3916 357 110 54 17 10 8 3 9
Proportion 0.873 0.080 0.025 0.012 0.004 0.002 0.002 0.001 0.002
--------------------------------------------------------------------------------
DATASRCE
n missing distinct Info Mean Gmd
7770 2230 3 0.766 2.488 0.6629
Value 1 2 3
Frequency 766 2449 4555
Proportion 0.099 0.315 0.586
--------------------------------------------------------------------------------
MALEMILI
n missing distinct Info Mean Gmd .05 .10
10000 0 60 0.525 1.005 1.845 0 0
.25 .50 .75 .90 .95
0 0 0 2 4
lowest : 0 1 2 3 4, highest: 90 93 94 97 99
--------------------------------------------------------------------------------
MALEVET
n missing distinct Info Mean Gmd .05 .10
10000 0 81 0.999 30.52 12.56 10 17
.25 .50 .75 .90 .95
25 31 37 44 49
lowest : 0 1 2 3 4, highest: 83 85 86 97 99
--------------------------------------------------------------------------------
VIETVETS
n missing distinct Info Mean Gmd .05 .10
10000 0 89 0.999 29.68 16.82 0 10
.25 .50 .75 .90 .95
20 29 39 49 55
lowest : 0 1 2 3 4, highest: 85 87 89 91 99
--------------------------------------------------------------------------------
WWIIVETS
n missing distinct Info Mean Gmd .05 .10
10000 0 98 1 32.86 19.74 0 11
.25 .50 .75 .90 .95
21 32 43 56 64
lowest : 0 1 2 3 4, highest: 93 94 96 98 99
--------------------------------------------------------------------------------
LOCALGOV
n missing distinct Info Mean Gmd .05 .10
10000 0 39 0.993 6.822 4.565 0 2
.25 .50 .75 .90 .95
4 6 9 12 14
lowest : 0 1 2 3 4, highest: 34 35 37 43 60
--------------------------------------------------------------------------------
STATEGOV
n missing distinct Info Mean Gmd .05 .10
10000 0 46 0.988 4.569 4.748 0 0
.25 .50 .75 .90 .95
1 3 6 10 14
lowest : 0 1 2 3 4, highest: 41 46 47 51 99
--------------------------------------------------------------------------------
FEDGOV
n missing distinct Info Mean Gmd .05 .10
10000 0 47 0.976 3.127 3.502 0 0
.25 .50 .75 .90 .95
1 2 4 7 10
lowest : 0 1 2 3 4, highest: 46 49 54 67 99
--------------------------------------------------------------------------------
SOLP3
n missing distinct Info Mean Gmd
12 9988 3 0.839 4.333 5.939
Value 0 2 12
Frequency 6 2 4
Proportion 0.500 0.167 0.333
--------------------------------------------------------------------------------
SOLIH
n missing distinct Info Mean Gmd
687 9313 6 0.203 11.19 1.517
lowest : 0 1 2 3 4, highest: 1 2 3 4 12
Value 0 1 2 3 4 12
Frequency 29 7 10 2 2 637
Proportion 0.042 0.010 0.015 0.003 0.003 0.927
--------------------------------------------------------------------------------
MAJOR
n missing distinct value
28 9972 1 X
Value X
Frequency 28
Proportion 1
--------------------------------------------------------------------------------
WEALTH2
n missing distinct Info Mean Gmd .05 .10
5451 4549 10 0.989 4.951 3.215 0 1
.25 .50 .75 .90 .95
3 5 7 9 9
lowest : 0 1 2 3 4, highest: 5 6 7 8 9
Value 0 1 2 3 4 5 6 7 8 9
Frequency 374 408 500 560 543 601 563 558 658 686
Proportion 0.069 0.075 0.092 0.103 0.100 0.110 0.103 0.102 0.121 0.126
--------------------------------------------------------------------------------
GEOCODE
n missing distinct Info Mean Gmd
1576 8424 7 0.967 6.978 5.12
lowest : 1 2 3 4 5, highest: 3 4 5 12 14
Value 1 2 3 4 5 12 14
Frequency 88 143 334 166 231 397 217
Proportion 0.056 0.091 0.212 0.105 0.147 0.252 0.138
--------------------------------------------------------------------------------
COLLECT1
n missing distinct value
490 9510 1 Y
Value Y
Frequency 490
Proportion 1
--------------------------------------------------------------------------------
VETERANS
n missing distinct value
1086 8914 1 Y
Value Y
Frequency 1086
Proportion 1
--------------------------------------------------------------------------------
BIBLE
n missing distinct value
892 9108 1 Y
Value Y
Frequency 892
Proportion 1
--------------------------------------------------------------------------------
CATLG
n missing distinct value
844 9156 1 Y
Value Y
Frequency 844
Proportion 1
--------------------------------------------------------------------------------
HOMEE
n missing distinct value
90 9910 1 Y
Value Y
Frequency 90
Proportion 1
--------------------------------------------------------------------------------
PETS
n missing distinct value
1453 8547 1 Y
Value Y
Frequency 1453
Proportion 1
--------------------------------------------------------------------------------
CDPLAY
n missing distinct value
1234 8766 1 Y
Value Y
Frequency 1234
Proportion 1
--------------------------------------------------------------------------------
STEREO
n missing distinct value
1294 8706 1 Y
Value Y
Frequency 1294
Proportion 1
--------------------------------------------------------------------------------
PCOWNERS
n missing distinct value
1058 8942 1 Y
Value Y
Frequency 1058
Proportion 1
--------------------------------------------------------------------------------
PHOTO
n missing distinct value
498 9502 1 Y
Value Y
Frequency 498
Proportion 1
--------------------------------------------------------------------------------
CRAFTS
n missing distinct value
784 9216 1 Y
Value Y
Frequency 784
Proportion 1
--------------------------------------------------------------------------------
FISHER
n missing distinct value
749 9251 1 Y
Value Y
Frequency 749
Proportion 1
--------------------------------------------------------------------------------
GARDENIN
n missing distinct value
1368 8632 1 Y
Value Y
Frequency 1368
Proportion 1
--------------------------------------------------------------------------------
BOATS
n missing distinct value
194 9806 1 Y
Value Y
Frequency 194
Proportion 1
--------------------------------------------------------------------------------
WALKER
n missing distinct value
1012 8988 1 Y
Value Y
Frequency 1012
Proportion 1
--------------------------------------------------------------------------------
KIDSTUFF
n missing distinct value
152 9848 1 Y
Value Y
Frequency 152
Proportion 1
--------------------------------------------------------------------------------
CARDS
n missing distinct value
103 9897 1 Y
Value Y
Frequency 103
Proportion 1
--------------------------------------------------------------------------------
PLATES
n missing distinct value
53 9947 1 Y
Value Y
Frequency 53
Proportion 1
--------------------------------------------------------------------------------
LIFESRC
n missing distinct Info Mean Gmd
4238 5762 3 0.851 2.034 0.7625
Value 1 2 3
Frequency 1016 2061 1161
Proportion 0.240 0.486 0.274
--------------------------------------------------------------------------------
PEPSTRFL
n missing distinct value
4697 5303 1 X
Value X
Frequency 4697
Proportion 1
--------------------------------------------------------------------------------
POP901
n missing distinct Info Mean Gmd .05 .10
10000 0 4333 1 3287 3767 544 706
.25 .50 .75 .90 .95
1003 1578 3058 6413 10917
lowest : 0 1 2 6 8, highest: 76571 79926 80680 85790 88442
--------------------------------------------------------------------------------
POP902
n missing distinct Info Mean Gmd .05 .10
10000 0 2279 1 874.2 997.4 140 185
.25 .50 .75 .90 .95
268 425 830 1759 2954
lowest : 0 1 2 3 4, highest: 18575 19094 20025 20621 21036
--------------------------------------------------------------------------------
POP903
n missing distinct Info Mean Gmd .05 .10
10000 0 2702 1 1240 1418 214 273
.25 .50 .75 .90 .95
382 590 1160 2410 4020
lowest : 0 1 2 3 4, highest: 27674 29306 30492 31198 33459
--------------------------------------------------------------------------------
POP90C1
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.796 58.89 47.63 0 0
.25 .50 .75 .90 .95
0 99 99 99 99
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
POP90C2
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.481 13.42 22.98 0 0
.25 .50 .75 .90 .95
0 0 0 82 99
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
POP90C3
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.782 26.17 37.92 0 0
.25 .50 .75 .90 .95
0 0 53 99 99
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
POP90C4
n missing distinct Info Mean Gmd .05 .10
10000 0 57 0.981 48.25 3.807 43 45
.25 .50 .75 .90 .95
47 49 50 51 52
lowest : 0 5 8 25 27, highest: 83 84 86 90 99
--------------------------------------------------------------------------------
POP90C5
n missing distinct Info Mean Gmd .05 .10
10000 0 57 0.981 51.06 3.847 48 49
.25 .50 .75 .90 .95
50 51 53 55 56
lowest : 0 10 14 16 17, highest: 73 75 92 95 99
--------------------------------------------------------------------------------
ETH1
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.994 84.89 18.71 35 60
.25 .50 .75 .90 .95
82 93 98 99 99
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
ETH2
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.958 7.531 11.9 0 0
.25 .50 .75 .90 .95
0 1 6 20 41
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
ETH3
n missing distinct Info Mean Gmd .05 .10
10000 0 44 0.698 0.7288 1.201 0 0
.25 .50 .75 .90 .95
0 0 1 1 2
lowest : 0 1 2 3 4, highest: 70 71 80 89 96
--------------------------------------------------------------------------------
ETH4
n missing distinct Info Mean Gmd .05 .10
10000 0 78 0.926 3.05 4.672 0 0
.25 .50 .75 .90 .95
0 1 3 7 13
lowest : 0 1 2 3 4, highest: 84 85 88 91 92
--------------------------------------------------------------------------------
ETH5
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.979 7.38 10.47 0 0
.25 .50 .75 .90 .95
1 2 7 19 34
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
ETH6
n missing distinct Info Mean Gmd .05 .10
10000 0 16 0.401 0.2296 0.4063 0 0
.25 .50 .75 .90 .95
0 0 0 1 1
lowest : 0 1 2 3 4, highest: 12 13 14 15 29
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 8421 1219 224 67 36 7 8 3 3 4 1
Proportion 0.842 0.122 0.022 0.007 0.004 0.001 0.001 0.000 0.000 0.000 0.000
Value 12 13 14 15 29
Frequency 2 1 1 2 1
Proportion 0.000 0.000 0.000 0.000 0.000
--------------------------------------------------------------------------------
ETH7
n missing distinct Info Mean Gmd .05 .10
10000 0 37 0.437 0.4235 0.7757 0 0
.25 .50 .75 .90 .95
0 0 0 1 2
lowest : 0 1 2 3 4, highest: 45 47 49 50 65
--------------------------------------------------------------------------------
ETH8
n missing distinct Info Mean Gmd .05 .10
10000 0 42 0.538 0.6521 1.168 0 0
.25 .50 .75 .90 .95
0 0 0 1 3
lowest : 0 1 2 3 4, highest: 41 42 43 57 67
--------------------------------------------------------------------------------
ETH9
n missing distinct Info Mean Gmd .05 .10
10000 0 43 0.498 0.6095 1.106 0 0
.25 .50 .75 .90 .95
0 0 0 1 3
lowest : 0 1 2 3 4, highest: 54 58 59 61 67
--------------------------------------------------------------------------------
ETH10
n missing distinct Info Mean Gmd .05 .10
10000 0 19 0.366 0.2496 0.4542 0 0
.25 .50 .75 .90 .95
0 0 0 1 1
lowest : 0 1 2 3 4, highest: 20 21 23 24 45
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 8584 996 210 88 41 32 11 16 5 4 2
Proportion 0.858 0.100 0.021 0.009 0.004 0.003 0.001 0.002 0.000 0.000 0.000
Value 11 13 18 20 21 23 24 45
Frequency 3 1 1 1 2 1 1 1
Proportion 0.000 0.000 0.000 0.000 0.000 0.000 0.000 0.000
--------------------------------------------------------------------------------
ETH11
n missing distinct Info Mean Gmd .05 .10
10000 0 22 0.294 0.2298 0.4308 0 0
.25 .50 .75 .90 .95
0 0 0 1 1
lowest : 0 1 2 3 4, highest: 19 21 22 24 45
--------------------------------------------------------------------------------
ETH12
n missing distinct Info Mean Gmd .05 .10
10000 0 22 0.043 0.0737 0.1466 0 0
.25 .50 .75 .90 .95
0 0 0 0 0
lowest : 0 1 2 4 5, highest: 24 25 26 48 54
--------------------------------------------------------------------------------
ETH13
n missing distinct Info Mean Gmd .05 .10
10000 0 96 0.941 5.024 7.821 0 0
.25 .50 .75 .90 .95
0 1 4 13 24
lowest : 0 1 2 3 4, highest: 91 92 93 94 96
--------------------------------------------------------------------------------
ETH14
n missing distinct Info Mean Gmd .05 .10
10000 0 25 0.421 0.3102 0.5579 0 0
.25 .50 .75 .90 .95
0 0 0 1 1
lowest : 0 1 2 3 4, highest: 26 28 29 31 36
--------------------------------------------------------------------------------
ETH15
n missing distinct Info Mean Gmd .05 .10
10000 0 49 0.219 0.3599 0.7031 0 0
.25 .50 .75 .90 .95
0 0 0 0 1
lowest : 0 1 2 3 4, highest: 67 68 69 70 77
--------------------------------------------------------------------------------
ETH16
n missing distinct Info Mean Gmd .05 .10
10000 0 43 0.884 1.503 2.162 0 0
.25 .50 .75 .90 .95
0 1 2 4 6
lowest : 0 1 2 3 4, highest: 40 42 43 44 61
--------------------------------------------------------------------------------
AGE901
n missing distinct Info Mean Gmd .05 .10
10000 0 66 0.996 34.72 8.055 26 28
.25 .50 .75 .90 .95
30 33 37 43 49
lowest : 0 15 17 18 19, highest: 76 77 78 79 84
--------------------------------------------------------------------------------
AGE902
n missing distinct Info Mean Gmd .05 .10
10000 0 63 0.997 42.13 8.149 33 35
.25 .50 .75 .90 .95
38 41 45 51 57
lowest : 0 19 20 21 22, highest: 76 77 78 79 84
--------------------------------------------------------------------------------
AGE903
n missing distinct Info Mean Gmd .05 .10
10000 0 54 0.997 45.27 8.034 36 38
.25 .50 .75 .90 .95
41 44 49 54 60
lowest : 0 29 30 31 32, highest: 77 78 79 80 84
--------------------------------------------------------------------------------
AGE904
n missing distinct Info Mean Gmd .05 .10
10000 0 62 0.996 36.11 7.127 28 29
.25 .50 .75 .90 .95
32 35 39 43 48
lowest : 0 18 19 20 21, highest: 74 75 76 77 78
--------------------------------------------------------------------------------
AGE905
n missing distinct Info Mean Gmd .05 .10
10000 0 57 0.996 44.87 6.564 37 39
.25 .50 .75 .90 .95
42 45 48 51 55
lowest : 0 23 24 25 26, highest: 74 75 76 77 78
--------------------------------------------------------------------------------
AGE906
n missing distinct Info Mean Gmd .05 .10
10000 0 48 0.996 48.04 6.338 40 42
.25 .50 .75 .90 .95
45 48 51 54 57
lowest : 0 32 33 34 35, highest: 74 75 76 77 78
--------------------------------------------------------------------------------
AGE907
n missing distinct Info Mean Gmd .05 .10
10000 0 52 0.998 24.43 8.094 10 15
.25 .50 .75 .90 .95
21 25 29 33 35
lowest : 0 1 2 3 4, highest: 47 48 50 51 55
--------------------------------------------------------------------------------
CHIL1
n missing distinct Info Mean Gmd .05 .10
10000 0 68 0.998 39.66 8.333 29 32
.25 .50 .75 .90 .95
35 39 44 49 52
lowest : 0 13 14 15 16, highest: 78 80 82 84 99
--------------------------------------------------------------------------------
CHIL2
n missing distinct Info Mean Gmd .05 .10
10000 0 54 0.994 38.41 5.688 30 33
.25 .50 .75 .90 .95
36 39 42 44 45
lowest : 0 6 10 11 13, highest: 60 61 67 75 99
--------------------------------------------------------------------------------
CHIL3
n missing distinct Info Mean Gmd .05 .10
10000 0 52 0.995 21.04 5.548 13 15
.25 .50 .75 .90 .95
18 21 24 26 28
lowest : 0 5 6 7 8, highest: 60 63 67 70 99
--------------------------------------------------------------------------------
AGEC1
n missing distinct Info Mean Gmd .05 .10
10000 0 73 0.993 12.04 5.339 5 7
.25 .50 .75 .90 .95
9 11 14 17 20
lowest : 0 1 2 3 4, highest: 76 77 83 89 90
--------------------------------------------------------------------------------
AGEC2
n missing distinct Info Mean Gmd .05 .10
10000 0 61 0.998 22.14 8.446 10 13
.25 .50 .75 .90 .95
18 22 26 32 36
lowest : 0 1 2 3 4, highest: 56 57 58 61 62
--------------------------------------------------------------------------------
AGEC3
n missing distinct Info Mean Gmd .05 .10
10000 0 52 0.996 20.7 6.591 12 14
.25 .50 .75 .90 .95
17 20 24 28 32
lowest : 0 1 2 3 4, highest: 48 50 55 58 99
--------------------------------------------------------------------------------
AGEC4
n missing distinct Info Mean Gmd .05 .10
10000 0 35 0.993 14.13 4.65 8 9
.25 .50 .75 .90 .95
12 14 16 19 21
lowest : 0 1 2 3 4, highest: 30 31 32 33 34
--------------------------------------------------------------------------------
AGEC5
n missing distinct Info Mean Gmd .05 .10
10000 0 36 0.993 11.91 4.507 5 7
.25 .50 .75 .90 .95
9 12 14 17 19
lowest : 0 1 2 3 4, highest: 31 32 34 36 99
--------------------------------------------------------------------------------
AGEC6
n missing distinct Info Mean Gmd .05 .10
10000 0 54 0.996 10.64 6.156 3 4
.25 .50 .75 .90 .95
7 10 13 17 21
lowest : 0 1 2 3 4, highest: 49 50 53 54 55
--------------------------------------------------------------------------------
AGEC7
n missing distinct Info Mean Gmd .05 .10
10000 0 68 0.995 7.765 6.466 1 2
.25 .50 .75 .90 .95
3 6 10 15 20
lowest : 0 1 2 3 4, highest: 65 69 72 78 83
--------------------------------------------------------------------------------
CHILC1
n missing distinct Info Mean Gmd .05 .10
10000 0 48 0.994 16.24 5.164 10 11
.25 .50 .75 .90 .95
13 16 19 22 25
lowest : 0 2 4 5 6, highest: 48 50 51 60 75
--------------------------------------------------------------------------------
CHILC2
n missing distinct Info Mean Gmd .05 .10
10000 0 38 0.988 16.15 3.625 11 13
.25 .50 .75 .90 .95
14 16 18 20 21
lowest : 0 3 4 5 6, highest: 40 50 60 75 99
--------------------------------------------------------------------------------
CHILC3
n missing distinct Info Mean Gmd .05 .10
10000 0 50 0.991 31.73 4.736 25 27
.25 .50 .75 .90 .95
30 32 34 36 38
lowest : 0 2 5 7 8, highest: 50 52 63 67 99
--------------------------------------------------------------------------------
CHILC4
n missing distinct Info Mean Gmd .05 .10
10000 0 45 0.993 19.74 4.583 13 15
.25 .50 .75 .90 .95
18 20 22 24 26
lowest : 0 1 3 4 5, highest: 44 50 52 60 75
--------------------------------------------------------------------------------
CHILC5
n missing distinct Info Mean Gmd .05 .10
10000 0 66 0.991 15.26 4.636 9 11
.25 .50 .75 .90 .95
13 15 17 19 21
lowest : 0 1 3 4 5, highest: 75 80 89 92 99
--------------------------------------------------------------------------------
HHAGE1
n missing distinct Info Mean Gmd .05 .10
10000 0 97 0.999 24.79 14.1 7 10
.25 .50 .75 .90 .95
16 24 32 39 47
lowest : 0 1 2 3 4, highest: 92 93 94 95 99
--------------------------------------------------------------------------------
HHAGE2
n missing distinct Info Mean Gmd .05 .10
10000 0 68 0.997 9.41 7.462 1 2
.25 .50 .75 .90 .95
4 8 12 17 22
lowest : 0 1 2 3 4, highest: 67 68 69 70 72
--------------------------------------------------------------------------------
HHAGE3
n missing distinct Info Mean Gmd .05 .10
10000 0 96 0.999 22.49 13.91 5 8
.25 .50 .75 .90 .95
14 21 29 37 44
lowest : 0 1 2 3 4, highest: 91 92 93 94 99
--------------------------------------------------------------------------------
HHN1
n missing distinct Info Mean Gmd .05 .10
10000 0 90 0.999 22.87 12.56 8 10
.25 .50 .75 .90 .95
15 21 28 37 46
lowest : 0 1 2 3 4, highest: 88 89 91 93 99
--------------------------------------------------------------------------------
HHN2
n missing distinct Info Mean Gmd .05 .10
10000 0 72 0.997 33.73 8.505 22 25
.25 .50 .75 .90 .95
30 33 38 43 47
lowest : 0 6 7 8 9, highest: 74 78 79 85 99
--------------------------------------------------------------------------------
HHN3
n missing distinct Info Mean Gmd .05 .10
10000 0 89 0.999 42.67 16.18 16 24
.25 .50 .75 .90 .95
35 44 52 60 65
lowest : 0 1 2 3 4, highest: 85 86 87 89 98
--------------------------------------------------------------------------------
HHN4
n missing distinct Info Mean Gmd .05 .10
10000 0 73 0.999 25.61 12.33 7 11
.25 .50 .75 .90 .95
19 26 32 39 44
lowest : 0 1 2 3 4, highest: 68 70 73 76 92
--------------------------------------------------------------------------------
HHN5
n missing distinct Info Mean Gmd .05 .10
10000 0 58 0.996 10.45 6.584 2 4
.25 .50 .75 .90 .95
7 10 13 17 22
lowest : 0 1 2 3 4, highest: 54 59 60 61 69
--------------------------------------------------------------------------------
HHN6
n missing distinct Info Mean Gmd .05 .10
10000 0 42 0.977 3.915 3.365 0.00 1.00
.25 .50 .75 .90 .95
2.00 3.00 5.00 7.00 10.05
lowest : 0 1 2 3 4, highest: 38 40 41 44 50
--------------------------------------------------------------------------------
MARR1
n missing distinct Info Mean Gmd .05 .10
10000 0 87 0.999 58.27 13.57 34 42
.25 .50 .75 .90 .95
52 61 67 71 74
lowest : 0 1 2 7 8, highest: 87 88 89 90 99
--------------------------------------------------------------------------------
MARR2
n missing distinct Info Mean Gmd .05 .10
10000 0 38 0.994 10.67 4.876 4 6
.25 .50 .75 .90 .95
8 10 13 17 19
lowest : 0 1 2 3 4, highest: 40 46 47 50 99
--------------------------------------------------------------------------------
MARR3
n missing distinct Info Mean Gmd .05 .10
10000 0 48 0.993 7.473 4.921 2 3
.25 .50 .75 .90 .95
4 7 9 13 16
lowest : 0 1 2 3 4, highest: 45 48 50 54 60
--------------------------------------------------------------------------------
MARR4
n missing distinct Info Mean Gmd .05 .10
10000 0 81 0.997 22.9 8.944 13 15
.25 .50 .75 .90 .95
18 21 26 34 39
lowest : 0 1 2 3 4, highest: 80 85 91 93 98
--------------------------------------------------------------------------------
HHP1
n missing distinct Info Mean Gmd .05 .10
10000 0 323 1 185.4 53.55 108 132
.25 .50 .75 .90 .95
159 181 211 248 272
lowest : 0 50 54 55 56, highest: 424 448 453 460 497
--------------------------------------------------------------------------------
HHP2
n missing distinct Info Mean Gmd .05 .10
10000 0 319 1 259.9 51.73 181 204
.25 .50 .75 .90 .95
236 262 287 314 331
lowest : 0 100 110 111 113, highest: 458 482 487 490 495
--------------------------------------------------------------------------------
DW1
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.999 70.31 27.22 16 32
.25 .50 .75 .90 .95
58 76 90 99 99
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
DW2
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.999 66.1 29.35 9 24
.25 .50 .75 .90 .95
51 72 87 97 99
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
DW3
n missing distinct Info Mean Gmd .05 .10
10000 0 57 0.934 2.86 4.087 0 0
.25 .50 .75 .90 .95
0 1 3 8 12
lowest : 0 1 2 3 4, highest: 58 59 61 62 63
--------------------------------------------------------------------------------
DW4
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.994 19.4 24.16 0 0
.25 .50 .75 .90 .95
1 9 30 57 74
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
DW5
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.981 16.52 22.07 0 0
.25 .50 .75 .90 .95
0 6 24 51 69
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
DW6
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.954 13.05 18.77 0 0
.25 .50 .75 .90 .95
0 3 18 42 60
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
DW7
n missing distinct Info Mean Gmd .05 .10
10000 0 68 0.655 1.783 3.134 0 0
.25 .50 .75 .90 .95
0 0 1 5 10
lowest : 0 1 2 3 4, highest: 75 78 87 90 99
--------------------------------------------------------------------------------
DW8
n missing distinct Info Mean Gmd .05 .10
10000 0 54 0.518 1.175 2.127 0 0
.25 .50 .75 .90 .95
0 0 0 3 7
lowest : 0 1 2 3 4, highest: 66 70 74 78 90
--------------------------------------------------------------------------------
DW9
n missing distinct Info Mean Gmd .05 .10
10000 0 51 0.331 0.6012 1.149 0 0
.25 .50 .75 .90 .95
0 0 0 1 2
lowest : 0 1 2 3 4, highest: 65 67 81 87 99
--------------------------------------------------------------------------------
HV1
n missing distinct Info Mean Gmd .05 .10
10000 0 2494 1 1080 873.8 313 376
.25 .50 .75 .90 .95
519 749 1271 2274 3021
lowest : 0 75 126 128 130, highest: 5965 5975 5985 5986 6000
--------------------------------------------------------------------------------
HV2
n missing distinct Info Mean Gmd .05 .10
10000 0 2622 1 1151 905.5 349.0 413.9
.25 .50 .75 .90 .95
559.0 816.0 1381.2 2397.2 3158.0
lowest : 0 125 162 163 168, highest: 5900 5934 5953 5955 6000
--------------------------------------------------------------------------------
HV3
n missing distinct Info Mean Gmd .05 .10
10000 0 14 0.973 4.281 2.477 2 2
.25 .50 .75 .90 .95
3 4 5 7 9
lowest : 0 1 2 3 4, highest: 9 10 11 12 13
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 87 321 1819 2257 1833 1284 900 527 328 258 116
Proportion 0.009 0.032 0.182 0.226 0.183 0.128 0.090 0.053 0.033 0.026 0.012
Value 11 12 13
Frequency 121 131 18
Proportion 0.012 0.013 0.002
--------------------------------------------------------------------------------
HV4
n missing distinct Info Mean Gmd .05 .10
10000 0 14 0.978 3.918 2.479 1 1
.25 .50 .75 .90 .95
2 4 5 7 8
lowest : 0 1 2 3 4, highest: 9 10 11 12 13
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 127 1032 1885 1941 1674 1161 836 529 353 229 140
Proportion 0.013 0.103 0.188 0.194 0.167 0.116 0.084 0.053 0.035 0.023 0.014
Value 11 12 13
Frequency 72 15 6
Proportion 0.007 0.002 0.001
--------------------------------------------------------------------------------
HU1
n missing distinct Info Mean Gmd .05 .10
10000 0 100 1 70 23.08 23 38
.25 .50 .75 .90 .95
60 76 85 92 95
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HU2
n missing distinct Info Mean Gmd .05 .10
10000 0 100 1 29.26 22.48 5 8
.25 .50 .75 .90 .95
14 24 39 60 74
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HU3
n missing distinct Info Mean Gmd .05 .10
10000 0 87 0.995 90.17 9.906 69 80
.25 .50 .75 .90 .95
89 94 97 98 99
lowest : 0 4 12 14 16, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HU4
n missing distinct Info Mean Gmd .05 .10
10000 0 88 0.995 9.105 8.753 1 2
.25 .50 .75 .90 .95
3 6 11 19 28
lowest : 0 1 2 3 4, highest: 84 86 88 96 99
--------------------------------------------------------------------------------
HU5
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.967 13.52 19.01 0 0
.25 .50 .75 .90 .95
0 4 16 47 65
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HHD1
n missing distinct Info Mean Gmd .05 .10
10000 0 87 0.999 35.59 14.4 11 19
.25 .50 .75 .90 .95
29 36 44 51 56
lowest : 0 1 2 3 4, highest: 83 84 87 88 96
--------------------------------------------------------------------------------
HHD2
n missing distinct Info Mean Gmd .05 .10
10000 0 94 0.999 71.63 15.37 43 53
.25 .50 .75 .90 .95
66 75 81 86 89
lowest : 0 4 7 9 10, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HHD3
n missing distinct Info Mean Gmd .05 .10
10000 0 94 1 58.95 17.75 28 37
.25 .50 .75 .90 .95
50 62 70 77 81
lowest : 0 1 4 5 6, highest: 92 93 94 95 99
--------------------------------------------------------------------------------
HHD4
n missing distinct Info Mean Gmd .05 .10
10000 0 81 0.999 27.8 13.36 7 12
.25 .50 .75 .90 .95
20 28 35 43 48
lowest : 0 1 2 3 4, highest: 78 79 80 81 91
--------------------------------------------------------------------------------
HHD5
n missing distinct Info Mean Gmd .05 .10
10000 0 91 0.998 82.07 12.63 56 68
.25 .50 .75 .90 .95
79 86 90 93 94
lowest : 0 2 5 7 8, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HHD6
n missing distinct Info Mean Gmd .05 .10
10000 0 92 0.998 17.24 11.73 5 7
.25 .50 .75 .90 .95
10 14 21 31 41
lowest : 0 1 2 3 4, highest: 92 93 95 98 99
--------------------------------------------------------------------------------
HHD7
n missing distinct Info Mean Gmd .05 .10
10000 0 49 0.993 7.793 5.134 2 3
.25 .50 .75 .90 .95
5 7 10 14 18
lowest : 0 1 2 3 4, highest: 44 45 46 47 48
--------------------------------------------------------------------------------
HHD8
n missing distinct Info Mean Gmd .05 .10
10000 0 11 0.893 1.614 1.107 0 0
.25 .50 .75 .90 .95
1 1 2 3 3
lowest : 0 1 2 3 4, highest: 6 7 8 9 10
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 1061 4095 3295 1050 307 119 43 18 8 3 1
Proportion 0.106 0.410 0.330 0.105 0.031 0.012 0.004 0.002 0.001 0.000 0.000
--------------------------------------------------------------------------------
HHD9
n missing distinct Info Mean Gmd .05 .10
10000 0 45 0.989 6.17 4.387 1 2
.25 .50 .75 .90 .95
3 5 8 11 15
lowest : 0 1 2 3 4, highest: 40 41 42 43 44
--------------------------------------------------------------------------------
HHD10
n missing distinct Info Mean Gmd .05 .10
10000 0 64 0.996 13.57 7.137 6 7
.25 .50 .75 .90 .95
9 12 16 22 27
lowest : 0 1 2 3 4, highest: 68 74 88 89 99
--------------------------------------------------------------------------------
HHD11
n missing distinct Info Mean Gmd .05 .10
10000 0 68 0.999 18.95 10.17 7 9
.25 .50 .75 .90 .95
12 18 24 31 36
lowest : 0 1 2 3 4, highest: 64 65 72 73 99
--------------------------------------------------------------------------------
HHD12
n missing distinct Info Mean Gmd .05 .10
10000 0 41 0.983 4.768 3.729 1 1
.25 .50 .75 .90 .95
2 4 6 9 12
lowest : 0 1 2 3 4, highest: 36 38 39 46 91
--------------------------------------------------------------------------------
ETHC1
n missing distinct Info Mean Gmd .05 .10
10000 0 42 0.998 16.71 7.709 3 7
.25 .50 .75 .90 .95
13 18 22 25 26
lowest : 0 1 2 3 4, highest: 37 38 39 41 42
--------------------------------------------------------------------------------
ETHC2
n missing distinct Info Mean Gmd .05 .10
10000 0 90 0.999 50.72 14.92 18 31
.25 .50 .75 .90 .95
46 54 60 64 66
lowest : 0 1 2 3 4, highest: 85 86 87 88 89
--------------------------------------------------------------------------------
ETHC3
n missing distinct Info Mean Gmd .05 .10
10000 0 96 0.999 17.51 12.44 3 5
.25 .50 .75 .90 .95
10 15 22 30 39
lowest : 0 1 2 3 4, highest: 93 94 95 96 99
--------------------------------------------------------------------------------
ETHC4
n missing distinct Info Mean Gmd .05 .10
10000 0 38 0.804 1.951 3.194 0 0
.25 .50 .75 .90 .95
0 0 1 6 12
lowest : 0 1 2 3 4, highest: 33 34 36 37 39
--------------------------------------------------------------------------------
ETHC5
n missing distinct Info Mean Gmd .05 .10
10000 0 70 0.927 4.596 7.256 0 0
.25 .50 .75 .90 .95
0 1 4 12 25
lowest : 0 1 2 3 4, highest: 65 66 67 68 70
--------------------------------------------------------------------------------
ETHC6
n missing distinct Info Mean Gmd .05 .10
10000 0 36 0.522 0.8818 1.592 0 0
.25 .50 .75 .90 .95
0 0 0 2 5
lowest : 0 1 2 3 4, highest: 31 32 34 36 54
--------------------------------------------------------------------------------
HVP1
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.925 14.2 22.59 0 0
.25 .50 .75 .90 .95
0 1 11 65 88
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HVP2
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.982 21.99 31.17 0 0
.25 .50 .75 .90 .95
1 4 32 88 96
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HVP3
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.998 35.99 40.28 0 1
.25 .50 .75 .90 .95
4 18 73 97 99
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HVP4
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.997 51.98 41.83 2 4
.25 .50 .75 .90 .95
15 50 93 99 99
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HVP5
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.976 74.45 30.99 15 26
.25 .50 .75 .90 .95
53 90 99 99 99
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HVP6
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.745 6.579 11.61 0 0
.25 .50 .75 .90 .95
0 0 2 18 50
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HUR1
n missing distinct Info Mean Gmd .05 .10
10000 0 78 0.982 4.932 6.285 0 0
.25 .50 .75 .90 .95
1 2 6 12 19
lowest : 0 1 2 3 4, highest: 81 83 88 89 90
--------------------------------------------------------------------------------
HUR2
n missing distinct Info Mean Gmd .05 .10
10000 0 100 1 45.9 24.33 11 19
.25 .50 .75 .90 .95
32 44 60 76 86
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
RHP1
n missing distinct Info Mean Gmd .05 .10
10000 0 73 0.999 53.04 11.27 37 42
.25 .50 .75 .90 .95
48 52 58 66 71
lowest : 0 8 13 14 16, highest: 80 81 82 83 84
--------------------------------------------------------------------------------
RHP2
n missing distinct Info Mean Gmd .05 .10
10000 0 72 0.999 53.88 10.82 38 43
.25 .50 .75 .90 .95
49 54 59 66 70
lowest : 0 14 16 17 18, highest: 82 83 84 85 86
--------------------------------------------------------------------------------
RHP3
n missing distinct Info Mean Gmd .05 .10
10000 0 21 0.974 14.15 2.522 10 12
.25 .50 .75 .90 .95
13 14 15 17 18
lowest : 0 10 11 12 13, highest: 25 26 27 29 33
--------------------------------------------------------------------------------
RHP4
n missing distinct Info Mean Gmd .05 .10
10000 0 17 0.86 4.404 1.095 3 3
.25 .50 .75 .90 .95
4 4 5 5 6
lowest : 0 1 2 3 4, highest: 12 13 14 15 17
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 74 36 136 974 4762 3137 527 163 82 49 26
Proportion 0.007 0.004 0.014 0.097 0.476 0.314 0.053 0.016 0.008 0.005 0.003
Value 11 12 13 14 15 17
Frequency 13 8 8 1 3 1
Proportion 0.001 0.001 0.001 0.000 0.000 0.000
--------------------------------------------------------------------------------
HUPA1
n missing distinct Info Mean Gmd .05 .10
10000 0 90 0.991 10.12 12.36 0 0
.25 .50 .75 .90 .95
1 5 14 27 36
lowest : 0 1 2 3 4, highest: 86 87 89 93 95
--------------------------------------------------------------------------------
HUPA2
n missing distinct Info Mean Gmd .05 .10
10000 0 99 0.896 9.264 14.34 0 0
.25 .50 .75 .90 .95
0 1 10 31 48
lowest : 0 1 2 3 4, highest: 94 95 96 98 99
--------------------------------------------------------------------------------
HUPA3
n missing distinct Info Mean Gmd .05 .10
10000 0 97 0.869 8.517 12.72 0 0
.25 .50 .75 .90 .95
0 0 13 27 38
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HUPA4
n missing distinct Info Mean Gmd .05 .10
10000 0 67 0.998 11.15 8.039 2 3
.25 .50 .75 .90 .95
6 10 15 20 25
lowest : 0 1 2 3 4, highest: 82 85 86 88 99
--------------------------------------------------------------------------------
HUPA5
n missing distinct Info Mean Gmd .05 .10
10000 0 61 0.976 5.161 6.699 0 0
.25 .50 .75 .90 .95
0 2 7 14 20
lowest : 0 1 2 3 4, highest: 57 58 71 73 92
--------------------------------------------------------------------------------
HUPA6
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.943 10.87 15.87 0 0
.25 .50 .75 .90 .95
0 2 14 35 51
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HUPA7
n missing distinct Info Mean Gmd .05 .10
10000 0 37 0.796 1.623 2.471 0 0
.25 .50 .75 .90 .95
0 0 2 5 7
lowest : 0 1 2 3 4, highest: 42 46 50 63 71
--------------------------------------------------------------------------------
RP1
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.995 29.97 35.23 0 0
.25 .50 .75 .90 .95
2 15 56 85 92
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
RP2
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.999 42.94 39.88 0 1
.25 .50 .75 .90 .95
8 38 78 93 96
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
RP3
n missing distinct Info Mean Gmd .05 .10
10000 0 100 1 60.06 37.07 4 9
.25 .50 .75 .90 .95
30 69 91 97 98
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
RP4
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.999 76.84 25.18 24 38
.25 .50 .75 .90 .95
65 86 96 98 99
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
MSA
n missing distinct Info Mean Gmd .05 .10
9985 15 269 0.989 3569 3287 0 0
.25 .50 .75 .90 .95
520 3360 5960 7490 8280
lowest : 0 40 120 200 220, highest: 9200 9260 9270 9340 9360
--------------------------------------------------------------------------------
ADI
n missing distinct Info Mean Gmd .05 .10
9985 15 187 0.999 186.1 152.5 13 15
.25 .50 .75 .90 .95
65 175 279 373 431
lowest : 0 3 9 11 13, highest: 613 621 625 627 645
--------------------------------------------------------------------------------
DMA
n missing distinct Info Mean Gmd .05 .10
9985 15 188 0.999 664 129.6 515 524
.25 .50 .75 .90 .95
561 634 800 819 825
lowest : 0 501 503 504 505, highest: 855 862 866 868 881
--------------------------------------------------------------------------------
IC1
n missing distinct Info Mean Gmd .05 .10
10000 0 825 1 342.7 169.1 151 181
.25 .50 .75 .90 .95
233 313 420 540 628
lowest : 0 50 52 55 57, highest: 1351 1376 1388 1397 1500
--------------------------------------------------------------------------------
IC2
n missing distinct Info Mean Gmd .05 .10
10000 0 874 1 389.9 180.1 183 219
.25 .50 .75 .90 .95
275 360 469 600 692
lowest : 0 50 51 53 59, highest: 1386 1422 1453 1469 1500
--------------------------------------------------------------------------------
IC3
n missing distinct Info Mean Gmd .05 .10
10000 0 842 1 390.1 171.7 199 229
.25 .50 .75 .90 .95
280 358 472 599 700
lowest : 0 66 71 72 74, highest: 1187 1188 1193 1201 1394
--------------------------------------------------------------------------------
IC4
n missing distinct Info Mean Gmd .05 .10
10000 0 892 1 433.7 184 227 261
.25 .50 .75 .90 .95
316 402 518 658 765
lowest : 0 57 73 77 82, highest: 1242 1243 1259 1394 1500
--------------------------------------------------------------------------------
IC5
n missing distinct Info Mean Gmd .05 .10
10000 0 7411 1 15858 8028 7507 8755
.25 .50 .75 .90 .95
10831 13912 18450 24831 30452
lowest : 0 1972 2909 2970 3030, highest: 100528 112390 119336 119454 159306
--------------------------------------------------------------------------------
IC6
n missing distinct Info Mean Gmd .05 .10
10000 0 91 0.999 21.33 16.08 3 5
.25 .50 .75 .90 .95
10 19 30 41 48
lowest : 0 1 2 3 4, highest: 89 90 94 98 99
--------------------------------------------------------------------------------
IC7
n missing distinct Info Mean Gmd .05 .10
10000 0 55 0.998 16.79 8.87 4 6
.25 .50 .75 .90 .95
11 17 22 26 29
lowest : 0 1 2 3 4, highest: 54 55 57 58 99
--------------------------------------------------------------------------------
IC8
n missing distinct Info Mean Gmd .05 .10
10000 0 47 0.997 15.71 6.821 5 8
.25 .50 .75 .90 .95
12 16 19 23 26
lowest : 0 1 2 3 4, highest: 43 44 45 61 99
--------------------------------------------------------------------------------
IC9
n missing distinct Info Mean Gmd .05 .10
10000 0 53 0.998 18.44 8.237 6 9
.25 .50 .75 .90 .95
14 18 23 28 31
lowest : 0 1 2 3 4, highest: 50 52 53 67 99
--------------------------------------------------------------------------------
IC10
n missing distinct Info Mean Gmd .05 .10
10000 0 56 0.999 16.14 10.84 3 5
.25 .50 .75 .90 .95
9 15 23 30 33
lowest : 0 1 2 3 4, highest: 51 52 53 54 99
--------------------------------------------------------------------------------
IC11
n missing distinct Info Mean Gmd .05 .10
10000 0 40 0.991 5.673 6.061 0 0
.25 .50 .75 .90 .95
1 4 8 14 18
lowest : 0 1 2 3 4, highest: 35 36 37 39 47
--------------------------------------------------------------------------------
IC12
n missing distinct Info Mean Gmd .05 .10
10000 0 32 0.926 2.269 3.083 0 0
.25 .50 .75 .90 .95
0 1 3 7 9
lowest : 0 1 2 3 4, highest: 28 31 32 33 50
--------------------------------------------------------------------------------
IC13
n missing distinct Info Mean Gmd .05 .10
10000 0 21 0.731 0.9277 1.464 0 0
.25 .50 .75 .90 .95
0 0 1 3 4
lowest : 0 1 2 3 4, highest: 16 17 18 20 28
--------------------------------------------------------------------------------
IC14
n missing distinct Info Mean Gmd .05 .10
10000 0 54 0.835 1.897 3.014 0 0
.25 .50 .75 .90 .95
0 0 2 5 9
lowest : 0 1 2 3 4, highest: 54 58 59 60 70
--------------------------------------------------------------------------------
IC15
n missing distinct Info Mean Gmd .05 .10
10000 0 87 0.999 14.73 13.28 0 2
.25 .50 .75 .90 .95
5 12 21 31 39
lowest : 0 1 2 3 4, highest: 86 87 88 92 99
--------------------------------------------------------------------------------
IC16
n missing distinct Info Mean Gmd .05 .10
10000 0 61 0.999 15.79 10.22 2 4
.25 .50 .75 .90 .95
9 16 22 27 31
lowest : 0 1 2 3 4, highest: 58 60 67 68 99
--------------------------------------------------------------------------------
IC17
n missing distinct Info Mean Gmd .05 .10
10000 0 55 0.998 16.31 8.315 4 6
.25 .50 .75 .90 .95
11 17 21 25 28
lowest : 0 1 2 3 4, highest: 52 53 59 62 99
--------------------------------------------------------------------------------
IC18
n missing distinct Info Mean Gmd .05 .10
10000 0 60 0.999 20.71 9.33 7 10
.25 .50 .75 .90 .95
15 21 26 31 34
lowest : 0 1 2 3 4, highest: 56 57 62 67 99
--------------------------------------------------------------------------------
IC19
n missing distinct Info Mean Gmd .05 .10
10000 0 60 0.999 18.86 11.96 3 6
.25 .50 .75 .90 .95
11 18 26 33 37
lowest : 0 1 2 3 4, highest: 55 56 58 59 68
--------------------------------------------------------------------------------
IC20
n missing distinct Info Mean Gmd .05 .10
10000 0 45 0.992 6.697 7.096 0 0
.25 .50 .75 .90 .95
2 4 10 17 21
lowest : 0 1 2 3 4, highest: 40 41 43 49 64
--------------------------------------------------------------------------------
IC21
n missing distinct Info Mean Gmd .05 .10
10000 0 34 0.923 2.643 3.627 0 0
.25 .50 .75 .90 .95
0 1 4 8 11
lowest : 0 1 2 3 4, highest: 29 30 32 40 50
--------------------------------------------------------------------------------
IC22
n missing distinct Info Mean Gmd .05 .10
10000 0 23 0.735 1.124 1.779 0 0
.25 .50 .75 .90 .95
0 0 1 4 6
lowest : 0 1 2 3 4, highest: 18 19 20 24 30
--------------------------------------------------------------------------------
IC23
n missing distinct Info Mean Gmd .05 .10
10000 0 62 0.831 2.318 3.727 0 0
.25 .50 .75 .90 .95
0 0 2 6 11
lowest : 0 1 2 3 4, highest: 63 65 66 70 99
--------------------------------------------------------------------------------
HHAS1
n missing distinct Info Mean Gmd .05 .10
10000 0 94 0.999 26.96 15.21 7 10
.25 .50 .75 .90 .95
17 26 35 43 51
lowest : 0 1 2 3 4, highest: 91 92 93 95 99
--------------------------------------------------------------------------------
HHAS2
n missing distinct Info Mean Gmd .05 .10
10000 0 54 0.993 6.069 5.979 0 0
.25 .50 .75 .90 .95
2 4 8 13 18
lowest : 0 1 2 3 4, highest: 53 54 58 62 80
--------------------------------------------------------------------------------
HHAS3
n missing distinct Info Mean Gmd .05 .10
10000 0 95 1 43.21 19.35 16 22
.25 .50 .75 .90 .95
31 43 55 66 72
lowest : 0 1 2 3 4, highest: 91 92 93 94 99
--------------------------------------------------------------------------------
HHAS4
n missing distinct Info Mean Gmd .05 .10
10000 0 72 0.998 10.57 10.02 1 1
.25 .50 .75 .90 .95
4 8 14 23 30
lowest : 0 1 2 3 4, highest: 72 76 77 80 91
--------------------------------------------------------------------------------
MC1
n missing distinct Info Mean Gmd .05 .10
10000 0 94 1 48.15 17.63 26 30
.25 .50 .75 .90 .95
37 46 58 69 77
lowest : 0 6 8 9 10, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
MC2
n missing distinct Info Mean Gmd .05 .10
10000 0 95 1 51.08 17.68 22 30
.25 .50 .75 .90 .95
42 53 62 70 73
lowest : 0 1 2 3 4, highest: 90 91 92 94 99
--------------------------------------------------------------------------------
MC3
n missing distinct Info Mean Gmd .05 .10
10000 0 75 0.998 12.23 10.42 1 2
.25 .50 .75 .90 .95
5 9 16 25 32
lowest : 0 1 2 3 4, highest: 79 81 82 88 96
--------------------------------------------------------------------------------
TPE1
n missing distinct Info Mean Gmd .05 .10
10000 0 90 0.999 76.09 12.93 53 63
.25 .50 .75 .90 .95
72 79 84 88 91
lowest : 0 3 6 8 9, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
TPE2
n missing distinct Info Mean Gmd .05 .10
10000 0 53 0.997 13.02 7.228 4 5
.25 .50 .75 .90 .95
9 12 17 21 25
lowest : 0 1 2 3 4, highest: 49 52 58 61 66
--------------------------------------------------------------------------------
TPE3
n missing distinct Info Mean Gmd .05 .10
10000 0 57 0.851 2.51 4.017 0 0
.25 .50 .75 .90 .95
0 0 2 7 13
lowest : 0 1 2 3 4, highest: 54 55 61 62 71
--------------------------------------------------------------------------------
TPE4
n missing distinct Info Mean Gmd .05 .10
10000 0 52 0.796 1.945 3.191 0 0
.25 .50 .75 .90 .95
0 0 2 5 10
lowest : 0 1 2 3 4, highest: 48 52 55 61 71
--------------------------------------------------------------------------------
TPE5
n missing distinct Info Mean Gmd .05 .10
10000 0 31 0.237 0.4527 0.8653 0 0
.25 .50 .75 .90 .95
0 0 0 0 3
lowest : 0 1 2 3 4, highest: 26 32 33 35 38
--------------------------------------------------------------------------------
TPE6
n missing distinct Info Mean Gmd .05 .10
10000 0 15 0.15 0.1017 0.1967 0 0
.25 .50 .75 .90 .95
0 0 0 0 1
lowest : 0 1 2 3 4, highest: 11 12 13 15 17
Value 0 1 2 3 4 5 6 7 8 9 11
Frequency 9473 321 115 39 20 8 7 3 2 3 1
Proportion 0.947 0.032 0.012 0.004 0.002 0.001 0.001 0.000 0.000 0.000 0.000
Value 12 13 15 17
Frequency 2 3 1 2
Proportion 0.000 0.000 0.000 0.000
--------------------------------------------------------------------------------
TPE7
n missing distinct Info Mean Gmd .05 .10
10000 0 10 0.402 0.2358 0.4162 0 0
.25 .50 .75 .90 .95
0 0 0 1 1
lowest : 0 1 2 3 4, highest: 5 6 7 9 12
Value 0 1 2 3 4 5 6 7 9 12
Frequency 8418 1098 316 99 43 13 4 7 1 1
Proportion 0.842 0.110 0.032 0.010 0.004 0.001 0.000 0.001 0.000 0.000
--------------------------------------------------------------------------------
TPE8
n missing distinct Info Mean Gmd .05 .10
10000 0 67 0.983 4.041 4.519 0 0
.25 .50 .75 .90 .95
1 3 5 9 12
lowest : 0 1 2 3 4, highest: 67 69 71 91 99
--------------------------------------------------------------------------------
TPE9
n missing distinct Info Mean Gmd .05 .10
10000 0 37 0.98 3.304 3.34 0 0
.25 .50 .75 .90 .95
1 2 4 7 9
lowest : 0 1 2 3 4, highest: 35 39 43 83 99
--------------------------------------------------------------------------------
PEC1
n missing distinct Info Mean Gmd .05 .10
10000 0 69 0.883 2.14 3.366 0 0
.25 .50 .75 .90 .95
0 1 2 4 9
lowest : 0 1 2 3 4, highest: 66 67 70 72 79
--------------------------------------------------------------------------------
PEC2
n missing distinct Info Mean Gmd .05 .10
10000 0 96 0.999 18.38 19.73 0 1
.25 .50 .75 .90 .95
4 11 27 47 60
lowest : 0 1 2 3 4, highest: 92 93 94 96 99
--------------------------------------------------------------------------------
TPE10
n missing distinct Info Mean Gmd .05 .10
10000 0 53 0.997 19.52 6.958 10 12
.25 .50 .75 .90 .95
15 19 23 28 31
lowest : 0 2 3 4 5, highest: 51 52 53 59 90
--------------------------------------------------------------------------------
TPE11
n missing distinct Info Mean Gmd .05 .10
10000 0 52 0.997 23.81 7.124 14 16
.25 .50 .75 .90 .95
20 23 28 32 35
lowest : 0 3 5 6 7, highest: 50 51 52 54 60
--------------------------------------------------------------------------------
TPE12
n missing distinct Info Mean Gmd .05 .10
10000 0 44 0.991 5.296 5.086 0 0
.25 .50 .75 .90 .95
2 4 7 12 15
lowest : 0 1 2 3 4, highest: 39 44 47 48 51
--------------------------------------------------------------------------------
TPE13
n missing distinct Info Mean Gmd .05 .10
10000 0 93 1 60.45 18.95 27 36
.25 .50 .75 .90 .95
51 64 73 79 83
lowest : 0 5 7 8 9, highest: 93 94 95 96 99
--------------------------------------------------------------------------------
LFC1
n missing distinct Info Mean Gmd .05 .10
10000 0 96 0.999 64.57 14.07 41.95 50.00
.25 .50 .75 .90 .95
59.00 66.00 73.00 79.00 82.00
lowest : 0 2 4 5 6, highest: 93 94 95 96 99
--------------------------------------------------------------------------------
LFC2
n missing distinct Info Mean Gmd .05 .10
10000 0 97 0.999 73.89 14.6 49 58
.25 .50 .75 .90 .95
69 76 83 88 91
lowest : 0 3 4 5 6, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
LFC3
n missing distinct Info Mean Gmd .05 .10
10000 0 92 0.999 56.03 14.85 33 40
.25 .50 .75 .90 .95
49 57 65 72 76
lowest : 0 4 5 6 7, highest: 90 92 93 95 99
--------------------------------------------------------------------------------
LFC4
n missing distinct Info Mean Gmd .05 .10
10000 0 98 0.999 69.89 15.38 43 53
.25 .50 .75 .90 .95
64 72 79 85 88
lowest : 0 2 3 4 5, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
LFC5
n missing distinct Info Mean Gmd .05 .10
10000 0 93 0.999 53.02 15.06 29.95 37.00
.25 .50 .75 .90 .95
46.00 54.00 62.00 69.00 73.00
lowest : 0 2 4 5 6, highest: 90 91 92 94 99
--------------------------------------------------------------------------------
LFC6
n missing distinct Info Mean Gmd .05 .10
10000 0 91 0.999 64.19 16.99 37 46
.25 .50 .75 .90 .95
57 66 74 81 86
lowest : 0 8 11 12 13, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
LFC7
n missing distinct Info Mean Gmd .05 .10
10000 0 98 1 48.88 18.79 17 27
.25 .50 .75 .90 .95
40 50 60 68 74
lowest : 0 1 2 3 4, highest: 93 94 95 96 99
--------------------------------------------------------------------------------
LFC8
n missing distinct Info Mean Gmd .05 .10
10000 0 98 0.971 70.13 33 0 11
.25 .50 .75 .90 .95
56 78 99 99 99
lowest : 0 3 4 5 6, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
LFC9
n missing distinct Info Mean Gmd .05 .10
10000 0 91 0.848 62.85 45.32 0 0
.25 .50 .75 .90 .95
0 97 99 99 99
lowest : 0 4 6 11 13, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
LFC10
n missing distinct Info Mean Gmd .05 .10
10000 0 76 0.972 6.672 8.392 0 0
.25 .50 .75 .90 .95
0 4 9 17 24
lowest : 0 1 2 3 4, highest: 80 82 90 96 99
--------------------------------------------------------------------------------
OCC1
n missing distinct Info Mean Gmd .05 .10
10000 0 61 0.998 14.3 9.052 3 5
.25 .50 .75 .90 .95
9 13 19 26 30
lowest : 0 1 2 3 4, highest: 57 58 62 78 99
--------------------------------------------------------------------------------
OCC2
n missing distinct Info Mean Gmd .05 .10
10000 0 50 0.998 12.69 7.753 3 5
.25 .50 .75 .90 .95
8 11 17 22 26
lowest : 0 1 2 3 4, highest: 46 50 51 59 60
--------------------------------------------------------------------------------
OCC3
n missing distinct Info Mean Gmd .05 .10
10000 0 25 0.98 3.582 2.703 0 0
.25 .50 .75 .90 .95
2 3 5 7 8
lowest : 0 1 2 3 4, highest: 24 30 33 34 99
--------------------------------------------------------------------------------
OCC4
n missing distinct Info Mean Gmd .05 .10
10000 0 48 0.996 12.44 6.039 4 6
.25 .50 .75 .90 .95
9 12 15 19 22
lowest : 0 1 2 3 4, highest: 56 60 62 66 99
--------------------------------------------------------------------------------
OCC5
n missing distinct Info Mean Gmd .05 .10
10000 0 53 0.996 15.59 6.23 7 9
.25 .50 .75 .90 .95
12 15 19 23 25
lowest : 0 1 2 3 4, highest: 58 62 64 74 99
--------------------------------------------------------------------------------
OCC6
n missing distinct Info Mean Gmd .05 .10
10000 0 14 0.603 0.4315 0.7004 0 0
.25 .50 .75 .90 .95
0 0 1 1 2
lowest : 0 1 2 3 4, highest: 9 10 12 15 20
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 7309 1837 508 177 73 36 21 18 11 2 3
Proportion 0.731 0.184 0.051 0.018 0.007 0.004 0.002 0.002 0.001 0.000 0.000
Value 12 15 20
Frequency 3 1 1
Proportion 0.000 0.000 0.000
--------------------------------------------------------------------------------
OCC7
n missing distinct Info Mean Gmd .05 .10
10000 0 22 0.945 1.666 1.782 0 0
.25 .50 .75 .90 .95
0 1 2 4 5
lowest : 0 1 2 3 4, highest: 18 21 24 31 38
--------------------------------------------------------------------------------
OCC8
n missing distinct Info Mean Gmd .05 .10
10000 0 54 0.996 10.52 6.419 3 4
.25 .50 .75 .90 .95
7 10 13 18 22
lowest : 0 1 2 3 4, highest: 51 52 54 59 99
--------------------------------------------------------------------------------
OCC9
n missing distinct Info Mean Gmd .05 .10
10000 0 55 0.941 2.781 3.877 0 0
.25 .50 .75 .90 .95
0 1 3 7 11
lowest : 0 1 2 3 4, highest: 60 65 69 70 77
--------------------------------------------------------------------------------
OCC10
n missing distinct Info Mean Gmd .05 .10
10000 0 42 0.997 11.33 6.492 2 4
.25 .50 .75 .90 .95
7 11 15 19 21
lowest : 0 1 2 3 4, highest: 37 38 39 41 46
--------------------------------------------------------------------------------
OCC11
n missing distinct Info Mean Gmd .05 .10
10000 0 42 0.994 6.293 6.029 0 0
.25 .50 .75 .90 .95
2 5 9 14 18
lowest : 0 1 2 3 4, highest: 40 41 42 60 99
--------------------------------------------------------------------------------
OCC12
n missing distinct Info Mean Gmd .05 .10
10000 0 29 0.988 3.947 3.347 0 0
.25 .50 .75 .90 .95
2 3 6 8 9
lowest : 0 1 2 3 4, highest: 24 25 26 27 34
--------------------------------------------------------------------------------
OCC13
n missing distinct Info Mean Gmd .05 .10
10000 0 28 0.985 3.674 3.063 0 0
.25 .50 .75 .90 .95
2 3 5 7 9
lowest : 0 1 2 3 4, highest: 24 29 31 32 38
--------------------------------------------------------------------------------
EIC1
n missing distinct Info Mean Gmd .05 .10
10000 0 60 0.955 3.083 4.208 0 0
.25 .50 .75 .90 .95
0 1 3 8 13
lowest : 0 1 2 3 4, highest: 66 72 73 81 83
--------------------------------------------------------------------------------
EIC2
n missing distinct Info Mean Gmd .05 .10
10000 0 34 0.525 0.6919 1.235 0 0
.25 .50 .75 .90 .95
0 0 0 2 4
lowest : 0 1 2 3 4, highest: 31 33 34 36 49
--------------------------------------------------------------------------------
EIC3
n missing distinct Info Mean Gmd .05 .10
10000 0 36 0.992 6.402 4.3 0 2
.25 .50 .75 .90 .95
4 6 9 11 13
lowest : 0 1 2 3 4, highest: 32 34 35 36 99
--------------------------------------------------------------------------------
EIC4
n missing distinct Info Mean Gmd .05 .10
10000 0 62 0.999 16.98 11.11 3 6
.25 .50 .75 .90 .95
9 15 23 31 36
lowest : 0 1 2 3 4, highest: 59 60 63 71 99
--------------------------------------------------------------------------------
EIC5
n missing distinct Info Mean Gmd .05 .10
10000 0 32 0.985 4.317 3.301 0 1
.25 .50 .75 .90 .95
2 4 6 8 10
lowest : 0 1 2 3 4, highest: 28 29 30 32 34
--------------------------------------------------------------------------------
EIC6
n missing distinct Info Mean Gmd .05 .10
10000 0 21 0.974 2.696 2.33 0 0
.25 .50 .75 .90 .95
1 2 4 5 7
lowest : 0 1 2 3 4, highest: 17 18 21 22 24
--------------------------------------------------------------------------------
EIC7
n missing distinct Info Mean Gmd .05 .10
10000 0 27 0.987 4.351 3.221 0 1
.25 .50 .75 .90 .95
2 4 6 8 10
lowest : 0 1 2 3 4, highest: 22 23 24 27 31
--------------------------------------------------------------------------------
EIC8
n missing distinct Info Mean Gmd .05 .10
10000 0 54 0.997 16.84 6.632 8 10
.25 .50 .75 .90 .95
13 16 20 24 27
lowest : 0 1 2 3 4, highest: 50 54 60 66 99
--------------------------------------------------------------------------------
EIC9
n missing distinct Info Mean Gmd .05 .10
10000 0 40 0.994 6.939 5.003 1 2
.25 .50 .75 .90 .95
4 6 9 13 15
lowest : 0 1 2 3 4, highest: 35 37 41 56 99
--------------------------------------------------------------------------------
EIC10
n missing distinct Info Mean Gmd .05 .10
10000 0 30 0.986 4.721 3.187 0 1
.25 .50 .75 .90 .95
3 4 6 8 10
lowest : 0 1 2 3 4, highest: 25 26 27 29 31
--------------------------------------------------------------------------------
EIC11
n missing distinct Info Mean Gmd .05 .10
10000 0 36 0.976 3.284 2.948 0 0
.25 .50 .75 .90 .95
2 3 4 7 9
lowest : 0 1 2 3 4, highest: 35 36 37 44 99
--------------------------------------------------------------------------------
EIC12
n missing distinct Info Mean Gmd .05 .10
10000 0 26 0.929 1.538 1.84 0 0
.25 .50 .75 .90 .95
0 1 2 4 5
lowest : 0 1 2 3 4, highest: 23 24 25 27 28
--------------------------------------------------------------------------------
EIC13
n missing distinct Info Mean Gmd .05 .10
10000 0 40 0.993 8.045 4.677 2 3
.25 .50 .75 .90 .95
5 7 10 13 16
lowest : 0 1 2 3 4, highest: 36 42 51 60 99
--------------------------------------------------------------------------------
EIC14
n missing distinct Info Mean Gmd .05 .10
10000 0 52 0.994 8.02 5.277 1 3
.25 .50 .75 .90 .95
5 7 10 14 17
lowest : 0 1 2 3 4, highest: 54 56 57 58 66
--------------------------------------------------------------------------------
EIC15
n missing distinct Info Mean Gmd .05 .10
10000 0 47 0.992 6.639 4.562 1 2
.25 .50 .75 .90 .95
4 6 8 12 15
lowest : 0 1 2 3 4, highest: 59 61 65 76 78
--------------------------------------------------------------------------------
EIC16
n missing distinct Info Mean Gmd .05 .10
10000 0 41 0.988 4.656 4.05 0 1
.25 .50 .75 .90 .95
2 4 6 9 12
lowest : 0 1 2 3 4, highest: 36 38 40 47 65
--------------------------------------------------------------------------------
OEDC1
n missing distinct Info Mean Gmd .05 .10
10000 0 38 0.992 7.03 4.438 1 2
.25 .50 .75 .90 .95
4 6 9 12 15
lowest : 0 1 2 3 4, highest: 33 34 43 60 65
--------------------------------------------------------------------------------
OEDC2
n missing distinct Info Mean Gmd .05 .10
10000 0 46 0.988 4.661 4.688 0 0
.25 .50 .75 .90 .95
2 3 6 10 14
lowest : 0 1 2 3 4, highest: 41 46 47 49 51
--------------------------------------------------------------------------------
OEDC3
n missing distinct Info Mean Gmd .05 .10
10000 0 49 0.977 3.241 3.556 0 0
.25 .50 .75 .90 .95
1 2 4 7 10
lowest : 0 1 2 3 4, highest: 46 49 51 67 99
--------------------------------------------------------------------------------
OEDC4
n missing distinct Info Mean Gmd .05 .10
10000 0 51 0.995 8.056 5.7 1 2
.25 .50 .75 .90 .95
5 7 10 15 18
lowest : 0 1 2 3 4, highest: 56 61 65 84 99
--------------------------------------------------------------------------------
OEDC5
n missing distinct Info Mean Gmd .05 .10
10000 0 79 0.999 69.59 12.37 50 56
.25 .50 .75 .90 .95
64 71 77 82 85
lowest : 0 13 19 23 24, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
OEDC6
n missing distinct Info Mean Gmd .05 .10
10000 0 47 0.991 6.142 4.364 1 2
.25 .50 .75 .90 .95
3 5 8 11 14
lowest : 0 1 2 3 4, highest: 48 49 52 81 99
--------------------------------------------------------------------------------
OEDC7
n missing distinct Info Mean Gmd .05 .10
10000 0 13 0.687 0.5007 0.7592 0 0
.25 .50 .75 .90 .95
0 0 1 2 2
lowest : 0 1 2 3 4, highest: 8 9 11 12 13
Value 0 1 2 3 4 5 6 7 8 9 11
Frequency 6708 2238 683 216 90 37 12 9 1 2 1
Proportion 0.671 0.224 0.068 0.022 0.009 0.004 0.001 0.001 0.000 0.000 0.000
Value 12 13
Frequency 1 2
Proportion 0.000 0.000
--------------------------------------------------------------------------------
EC1
n missing distinct Info Mean Gmd .05 .10
10000 0 76 0.882 128.4 15.56 120 120
.25 .50 .75 .90 .95
120 120 140 152 160
lowest : 0 89 90 91 92, highest: 159 160 161 162 170
--------------------------------------------------------------------------------
EC2
n missing distinct Info Mean Gmd .05 .10
10000 0 66 0.997 8.739 8.421 0 1
.25 .50 .75 .90 .95
3 6 12 19 24
lowest : 0 1 2 3 4, highest: 71 72 73 82 99
--------------------------------------------------------------------------------
EC3
n missing distinct Info Mean Gmd .05 .10
10000 0 49 0.998 12.72 8.396 2 4
.25 .50 .75 .90 .95
7 12 18 23 26
lowest : 0 1 2 3 4, highest: 45 47 53 55 67
--------------------------------------------------------------------------------
EC4
n missing distinct Info Mean Gmd .05 .10
10000 0 67 0.999 28.74 11.6 11 15
.25 .50 .75 .90 .95
22 29 36 42 45
lowest : 0 1 2 3 4, highest: 62 63 64 65 99
--------------------------------------------------------------------------------
EC5
n missing distinct Info Mean Gmd .05 .10
10000 0 52 0.998 20.75 7.817 10 12
.25 .50 .75 .90 .95
16 21 25 30 32
lowest : 0 1 2 3 4, highest: 47 48 53 54 56
--------------------------------------------------------------------------------
EC6
n missing distinct Info Mean Gmd .05 .10
10000 0 28 0.991 6.585 3.771 2 2
.25 .50 .75 .90 .95
4 6 9 11 13
lowest : 0 1 2 3 4, highest: 24 25 26 31 33
--------------------------------------------------------------------------------
EC7
n missing distinct Info Mean Gmd .05 .10
10000 0 56 0.999 14.28 10.85 2 4
.25 .50 .75 .90 .95
7 12 20 29 34
lowest : 0 1 2 3 4, highest: 51 52 53 57 65
--------------------------------------------------------------------------------
EC8
n missing distinct Info Mean Gmd .05 .10
10000 0 53 0.995 7.414 7.063 0 1
.25 .50 .75 .90 .95
3 5 10 17 22
lowest : 0 1 2 3 4, highest: 50 51 54 58 59
--------------------------------------------------------------------------------
SEC1
n missing distinct Info Mean Gmd .05 .10
10000 0 45 0.983 3.744 3.332 0.00 1.00
.25 .50 .75 .90 .95
1.00 3.00 5.00 7.00 9.05
lowest : 0 1 2 3 4, highest: 52 55 57 71 90
--------------------------------------------------------------------------------
SEC2
n missing distinct Info Mean Gmd .05 .10
10000 0 70 0.998 21.94 8.187 8 12
.25 .50 .75 .90 .95
18 22 26 30 33
lowest : 0 1 2 3 4, highest: 75 76 80 90 99
--------------------------------------------------------------------------------
SEC3
n missing distinct Info Mean Gmd .05 .10
10000 0 12 0.941 1.896 1.439 0 0
.25 .50 .75 .90 .95
1 2 3 4 4
lowest : 0 1 2 3 4, highest: 7 8 9 10 11
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 1323 2909 3008 1659 673 286 93 31 9 6 2
Proportion 0.132 0.291 0.301 0.166 0.067 0.029 0.009 0.003 0.001 0.001 0.000
Value 11
Frequency 1
Proportion 0.000
--------------------------------------------------------------------------------
SEC4
n missing distinct Info Mean Gmd .05 .10
10000 0 45 0.997 17.06 7.005 5 9
.25 .50 .75 .90 .95
13 18 21 24 26
lowest : 0 1 2 3 4, highest: 40 41 42 44 48
--------------------------------------------------------------------------------
SEC5
n missing distinct Info Mean Gmd .05 .10
10000 0 68 0.991 6.731 4.518 2 2
.25 .50 .75 .90 .95
4 6 8 11 14
lowest : 0 1 2 3 4, highest: 75 85 86 89 99
--------------------------------------------------------------------------------
AFC1
n missing distinct Info Mean Gmd .05 .10
10000 0 45 0.439 0.5779 1.072 0 0
.25 .50 .75 .90 .95
0 0 0 1 2
lowest : 0 1 2 3 4, highest: 69 72 74 83 86
--------------------------------------------------------------------------------
AFC2
n missing distinct Info Mean Gmd .05 .10
10000 0 62 0.536 1.028 1.881 0 0
.25 .50 .75 .90 .95
0 0 0 2 4
lowest : 0 1 2 3 4, highest: 90 93 94 95 97
--------------------------------------------------------------------------------
AFC3
n missing distinct Info Mean Gmd .05 .10
10000 0 20 0.164 0.145 0.2812 0 0
.25 .50 .75 .90 .95
0 0 0 0 1
lowest : 0 1 2 3 4, highest: 18 20 29 53 60
Value 0 1 2 3 4 5 6 7 8 9 13
Frequency 9422 312 118 57 32 17 13 4 8 3 1
Proportion 0.942 0.031 0.012 0.006 0.003 0.002 0.001 0.000 0.001 0.000 0.000
Value 14 15 16 17 18 20 29 53 60
Frequency 1 2 2 2 2 1 1 1 1
Proportion 0.000 0.000 0.000 0.000 0.000 0.000 0.000 0.000 0.000
--------------------------------------------------------------------------------
AFC4
n missing distinct Info Mean Gmd .05 .10
10000 0 46 0.996 15.59 5.74 7 9
.25 .50 .75 .90 .95
12 15 19 22 24
lowest : 0 1 2 3 4, highest: 45 46 50 55 63
--------------------------------------------------------------------------------
AFC5
n missing distinct Info Mean Gmd .05 .10
10000 0 79 0.999 31.36 11.54 15 19
.25 .50 .75 .90 .95
25 31 37 44 49
lowest : 0 1 2 3 4, highest: 79 82 83 84 85
--------------------------------------------------------------------------------
AFC6
n missing distinct Info Mean Gmd .05 .10
10000 0 21 0.909 1.244 1.504 0 0
.25 .50 .75 .90 .95
0 1 2 3 4
lowest : 0 1 2 3 4, highest: 16 20 25 35 50
--------------------------------------------------------------------------------
VC1
n missing distinct Info Mean Gmd .05 .10
10000 0 91 0.999 30.54 16.01 7 13
.25 .50 .75 .90 .95
21 30 39 49 55
lowest : 0 1 2 3 4, highest: 87 88 89 91 99
--------------------------------------------------------------------------------
VC2
n missing distinct Info Mean Gmd .05 .10
10000 0 72 0.998 17.88 10.5 0 6
.25 .50 .75 .90 .95
12 17 23 30 34
lowest : 0 1 2 3 4, highest: 76 80 81 88 99
--------------------------------------------------------------------------------
VC3
n missing distinct Info Mean Gmd .05 .10
10000 0 99 1 33.78 19.04 8 13
.25 .50 .75 .90 .95
22 32 44 56 64
lowest : 0 1 2 3 4, highest: 94 95 96 98 99
--------------------------------------------------------------------------------
VC4
n missing distinct Info Mean Gmd .05 .10
10000 0 79 0.994 11.16 10.69 0 0
.25 .50 .75 .90 .95
4 9 15 24 31
lowest : 0 1 2 3 4, highest: 85 86 89 93 99
--------------------------------------------------------------------------------
ANC1
n missing distinct Info Mean Gmd .05 .10
10000 0 38 0.768 0.731 1.093 0 0
.25 .50 .75 .90 .95
0 0 1 2 2
lowest : 0 1 2 3 4, highest: 44 45 48 50 70
--------------------------------------------------------------------------------
ANC2
n missing distinct Info Mean Gmd .05 .10
10000 0 41 0.993 5.519 4.547 0 1
.25 .50 .75 .90 .95
2 5 8 11 13
lowest : 0 1 2 3 4, highest: 39 40 43 45 47
--------------------------------------------------------------------------------
ANC3
n missing distinct Info Mean Gmd .05 .10
10000 0 20 0.85 0.8837 1.092 0 0
.25 .50 .75 .90 .95
0 1 1 2 3
lowest : 0 1 2 3 4, highest: 15 16 17 25 32
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 4471 3898 1034 301 88 61 42 22 24 14 13
Proportion 0.447 0.390 0.103 0.030 0.009 0.006 0.004 0.002 0.002 0.001 0.001
Value 11 12 13 14 15 16 17 25 32
Frequency 6 9 3 3 2 4 3 1 1
Proportion 0.001 0.001 0.000 0.000 0.000 0.000 0.000 0.000 0.000
--------------------------------------------------------------------------------
ANC4
n missing distinct Info Mean Gmd .05 .10
10000 0 68 0.997 10.05 8.02 1 2
.25 .50 .75 .90 .95
5 8 13 19 26
lowest : 0 1 2 3 4, highest: 65 67 70 73 99
--------------------------------------------------------------------------------
ANC5
n missing distinct Info Mean Gmd .05 .10
10000 0 13 0.337 0.2016 0.3669 0 0
.25 .50 .75 .90 .95
0 0 0 1 1
lowest : 0 1 2 3 4, highest: 8 9 10 12 32
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 8715 920 210 68 44 16 14 4 4 2 1
Proportion 0.872 0.092 0.021 0.007 0.004 0.002 0.001 0.000 0.000 0.000 0.000
Value 12 32
Frequency 1 1
Proportion 0.000 0.000
--------------------------------------------------------------------------------
ANC6
n missing distinct Info Mean Gmd .05 .10
10000 0 11 0.359 0.1767 0.3136 0 0
.25 .50 .75 .90 .95
0 0 0 1 1
lowest : 0 1 2 3 4, highest: 6 7 8 9 11
Value 0 1 2 3 4 5 6 7 8 9 11
Frequency 8614 1133 181 47 12 6 3 1 1 1 1
Proportion 0.861 0.113 0.018 0.005 0.001 0.001 0.000 0.000 0.000 0.000 0.000
--------------------------------------------------------------------------------
ANC7
n missing distinct Info Mean Gmd .05 .10
10000 0 32 0.989 4.854 3.539 0 1
.25 .50 .75 .90 .95
3 4 7 9 11
lowest : 0 1 2 3 4, highest: 28 31 33 42 44
--------------------------------------------------------------------------------
ANC8
n missing distinct Info Mean Gmd .05 .10
10000 0 27 0.937 1.815 2.241 0 0
.25 .50 .75 .90 .95
0 1 2 5 6
lowest : 0 1 2 3 4, highest: 23 25 27 31 41
--------------------------------------------------------------------------------
ANC9
n missing distinct Info Mean Gmd .05 .10
10000 0 31 0.704 0.7559 1.223 0 0
.25 .50 .75 .90 .95
0 0 1 2 3
lowest : 0 1 2 3 4, highest: 28 31 37 44 50
--------------------------------------------------------------------------------
ANC10
n missing distinct Info Mean Gmd .05 .10
10000 0 40 0.858 1.319 1.945 0 0
.25 .50 .75 .90 .95
0 1 1 3 5
lowest : 0 1 2 3 4, highest: 43 49 55 56 57
--------------------------------------------------------------------------------
ANC11
n missing distinct Info Mean Gmd .05 .10
10000 0 18 0.241 0.1503 0.2835 0 0
.25 .50 .75 .90 .95
0 0 0 0 1
lowest : 0 1 2 3 4, highest: 14 15 17 25 28
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 9120 639 115 54 25 16 13 5 3 1 1
Proportion 0.912 0.064 0.012 0.005 0.002 0.002 0.001 0.000 0.000 0.000 0.000
Value 12 13 14 15 17 25 28
Frequency 2 1 1 1 1 1 1
Proportion 0.000 0.000 0.000 0.000 0.000 0.000 0.000
--------------------------------------------------------------------------------
ANC12
n missing distinct Info Mean Gmd .05 .10
10000 0 23 0.457 0.4283 0.7708 0 0
.25 .50 .75 .90 .95
0 0 0 1 2
lowest : 0 1 2 3 4, highest: 18 19 20 22 23
--------------------------------------------------------------------------------
ANC13
n missing distinct Info Mean Gmd .05 .10
10000 0 12 0.835 0.735 0.8933 0 0
.25 .50 .75 .90 .95
0 1 1 2 2
lowest : 0 1 2 3 4, highest: 7 8 9 10 18
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 4856 3679 1015 299 80 43 13 6 4 1 3
Proportion 0.486 0.368 0.102 0.030 0.008 0.004 0.001 0.001 0.000 0.000 0.000
Value 18
Frequency 1
Proportion 0.000
--------------------------------------------------------------------------------
ANC14
n missing distinct Info Mean Gmd .05 .10
10000 0 21 0.794 0.7356 1.042 0 0
.25 .50 .75 .90 .95
0 0 1 2 3
lowest : 0 1 2 3 4, highest: 16 19 20 21 26
--------------------------------------------------------------------------------
ANC15
n missing distinct Info Mean Gmd
10000 0 8 0.171 0.0759 0.1443
lowest : 0 1 2 3 4, highest: 3 4 5 6 14
Value 0 1 2 3 4 5 6 14
Frequency 9393 512 71 16 3 1 2 2
Proportion 0.939 0.051 0.007 0.002 0.000 0.000 0.000 0.000
--------------------------------------------------------------------------------
POBC1
n missing distinct Info Mean Gmd .05 .10
10000 0 85 0.988 6.847 8.709 0 0
.25 .50 .75 .90 .95
1 3 8 18 27
lowest : 0 1 2 3 4, highest: 82 83 84 85 86
--------------------------------------------------------------------------------
POBC2
n missing distinct Info Mean Gmd .05 .10
10000 0 100 1 56.85 24.64 17 26
.25 .50 .75 .90 .95
42 59 75 84 87
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
LSC1
n missing distinct Info Mean Gmd .05 .10
10000 0 99 0.995 87.78 14.04 53 71
.25 .50 .75 .90 .95
86 94 97 98 99
lowest : 0 2 3 4 5, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
LSC2
n missing distinct Info Mean Gmd .05 .10
10000 0 98 0.973 5.924 8.57 0 0
.25 .50 .75 .90 .95
1 2 5 14 28
lowest : 0 1 2 3 4, highest: 94 95 96 98 99
--------------------------------------------------------------------------------
LSC3
n missing distinct Info Mean Gmd .05 .10
10000 0 56 0.782 1.966 3.265 0 0
.25 .50 .75 .90 .95
0 0 2 5 10
lowest : 0 1 2 3 4, highest: 57 58 59 75 78
--------------------------------------------------------------------------------
LSC4
n missing distinct Info Mean Gmd .05 .10
10000 0 52 0.978 3.533 3.819 0 0
.25 .50 .75 .90 .95
1 2 4 8 11
lowest : 0 1 2 3 4, highest: 60 62 66 77 99
--------------------------------------------------------------------------------
VOC1
n missing distinct Info Mean Gmd .05 .10
10000 0 81 0.986 91.81 9.413 73 82
.25 .50 .75 .90 .95
91 96 98 99 99
lowest : 0 7 14 15 16, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
VOC2
n missing distinct Info Mean Gmd .05 .10
10000 0 100 1 60.49 20.9 25 35
.25 .50 .75 .90 .95
50 63 74 83 87
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
VOC3
n missing distinct Info Mean Gmd .05 .10
10000 0 68 0.999 20.05 12.26 3 6
.25 .50 .75 .90 .95
12 19 27 35 39
lowest : 0 1 2 3 4, highest: 65 66 71 79 99
--------------------------------------------------------------------------------
HC1
n missing distinct Info Mean Gmd .05 .10
10000 0 32 0.99 6.949 4.361 2 3
.25 .50 .75 .90 .95
4 6 9 12 14
lowest : 0 1 2 3 4, highest: 27 28 29 30 31
--------------------------------------------------------------------------------
HC2
n missing distinct Info Mean Gmd .05 .10
10000 0 53 0.999 23.95 14.07 6 9
.25 .50 .75 .90 .95
15 22 33 43 51
lowest : 0 1 2 3 4, highest: 48 49 50 51 52
--------------------------------------------------------------------------------
HC3
n missing distinct Info Mean Gmd .05 .10
10000 0 61 0.858 2.583 3.967 0 0
.25 .50 .75 .90 .95
0 0 3 7 11
lowest : 0 1 2 3 4, highest: 79 82 91 98 99
--------------------------------------------------------------------------------
HC4
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.977 12.4 15.87 0 0
.25 .50 .75 .90 .95
0 6 17 33 48
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HC5
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.993 22.59 25.09 0 0
.25 .50 .75 .90 .95
3 16 33 58 76
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HC6
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.999 46.1 37.05 0 2
.25 .50 .75 .90 .95
16 46 73 93 98
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HC7
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.998 62.43 35.86 2 10
.25 .50 .75 .90 .95
38 70 91 99 99
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HC8
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.999 36.22 35.32 0 0
.25 .50 .75 .90 .95
8 29 59 88 96
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HC9
n missing distinct Info Mean Gmd .05 .10
10000 0 73 0.62 2.918 5.112 0 0
.25 .50 .75 .90 .95
0 0 1 9 18
lowest : 0 1 2 3 4, highest: 74 76 79 81 83
--------------------------------------------------------------------------------
HC10
n missing distinct Info Mean Gmd .05 .10
10000 0 41 0.618 1.335 2.291 0 0
.25 .50 .75 .90 .95
0 0 1 4 8
lowest : 0 1 2 3 4, highest: 36 37 38 44 49
--------------------------------------------------------------------------------
HC11
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.999 52.17 40.61 0 1
.25 .50 .75 .90 .95
14 59 86 95 97
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HC12
n missing distinct Info Mean Gmd .05 .10
10000 0 79 0.945 6.853 10.29 0 0
.25 .50 .75 .90 .95
0 1 7 24 35
lowest : 0 1 2 3 4, highest: 75 86 87 90 99
--------------------------------------------------------------------------------
HC13
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.999 29.8 30.42 0 2
.25 .50 .75 .90 .95
7 20 46 77 91
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HC14
n missing distinct Info Mean Gmd .05 .10
10000 0 75 0.841 4.425 7.126 0 0
.25 .50 .75 .90 .95
0 0 4 15 25
lowest : 0 1 2 3 4, highest: 84 89 93 94 97
--------------------------------------------------------------------------------
HC15
n missing distinct Info Mean Gmd .05 .10
10000 0 13 0.114 0.0727 0.1418 0 0
.25 .50 .75 .90 .95
0 0 0 0 0
lowest : 0 1 2 3 4, highest: 8 9 10 14 30
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 9603 258 73 38 12 5 2 2 1 2 1
Proportion 0.960 0.026 0.007 0.004 0.001 0.000 0.000 0.000 0.000 0.000 0.000
Value 14 30
Frequency 1 2
Proportion 0.000 0.000
--------------------------------------------------------------------------------
HC16
n missing distinct Info Mean Gmd .05 .10
10000 0 80 0.905 5.238 8.123 0 0
.25 .50 .75 .90 .95
0 1 5 16 27
lowest : 0 1 2 3 4, highest: 76 77 79 81 93
--------------------------------------------------------------------------------
HC17
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.847 82.83 25.11 7.95 34.00
.25 .50 .75 .90 .95
79.75 99.00 99.00 99.00 99.00
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HC18
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.875 15.09 23.46 0 0
.25 .50 .75 .90 .95
0 1 17 61 85
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HC19
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.946 72.24 36.19 2 6
.25 .50 .75 .90 .95
45 95 99 99 99
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HC20
n missing distinct Info Mean Gmd .05 .10
10000 0 34 0.447 97.7 2.477 95 97
.25 .50 .75 .90 .95
99 99 99 99 99
lowest : 0 46 60 62 64, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HC21
n missing distinct Info Mean Gmd .05 .10
10000 0 50 0.92 94.58 6.54 83 88
.25 .50 .75 .90 .95
94 98 99 99 99
lowest : 0 32 36 48 52, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
MHUC1
n missing distinct Info Mean Gmd .05 .10
10000 0 22 0.986 8.158 3.694 4 5
.25 .50 .75 .90 .95
6 8 9 12 15
lowest : 0 1 2 3 4, highest: 17 18 19 20 21
--------------------------------------------------------------------------------
MHUC2
n missing distinct Info Mean Gmd
10000 0 6 0.789 2.339 0.853
lowest : 0 1 2 3 4, highest: 1 2 3 4 5
Value 0 1 2 3 4 5
Frequency 374 372 5786 2528 842 98
Proportion 0.037 0.037 0.579 0.253 0.084 0.010
--------------------------------------------------------------------------------
AC1
n missing distinct Info Mean Gmd .05 .10
10000 0 25 0.986 5.836 3.071 1.95 2.00
.25 .50 .75 .90 .95
4.00 6.00 7.00 9.00 11.00
lowest : 0 1 2 3 4, highest: 20 21 22 23 24
--------------------------------------------------------------------------------
AC2
n missing distinct Info Mean Gmd .05 .10
10000 0 31 0.989 5.999 3.471 1 2
.25 .50 .75 .90 .95
4 6 8 10 12
lowest : 0 1 2 3 4, highest: 26 27 28 35 36
--------------------------------------------------------------------------------
ADATE_2
n missing distinct Info Mean Gmd
10000 0 1 0 9706 0
Value 9706
Frequency 10000
Proportion 1
--------------------------------------------------------------------------------
ADATE_3
n missing distinct Info Mean Gmd
9793 207 2 0 9606 0.0004085
Value 9604 9606
Frequency 1 9792
Proportion 0 1
--------------------------------------------------------------------------------
ADATE_4
n missing distinct Info Mean Gmd
9780 220 6 0.027 9604 0.05021
lowest : 9602 9603 9604 9605 9607, highest: 9603 9604 9605 9607 9609
Value 9602 9603 9604 9605 9607 9609
Frequency 3 3 9690 26 39 19
Proportion 0.000 0.000 0.991 0.003 0.004 0.002
--------------------------------------------------------------------------------
ADATE_5
n missing distinct Info Mean Gmd
6547 3453 1 0 9604 0
Value 9604
Frequency 6547
Proportion 1
--------------------------------------------------------------------------------
ADATE_6
n missing distinct Info Mean Gmd
9635 365 2 0.002 9603 0.002904
Value 9601 9603
Frequency 7 9628
Proportion 0.001 0.999
--------------------------------------------------------------------------------
ADATE_7
n missing distinct Info Mean Gmd
9096 904 3 0.152 9602 0.2969
Value 9512 9601 9602
Frequency 10 477 8609
Proportion 0.001 0.052 0.946
--------------------------------------------------------------------------------
ADATE_8
n missing distinct Info Mean Gmd
9669 331 4 0.203 9595 11.97
Value 9511 9512 9601 9603
Frequency 6 695 8965 3
Proportion 0.001 0.072 0.927 0.000
--------------------------------------------------------------------------------
ADATE_9
n missing distinct Info Mean Gmd
8828 1172 3 0.119 9511 0.1426
Value 9509 9510 9511
Frequency 288 78 8462
Proportion 0.033 0.009 0.959
--------------------------------------------------------------------------------
ADATE_10
n missing distinct Info Mean Gmd
6659 3341 2 0.228 9510 0.1521
Value 9510 9511
Frequency 6107 552
Proportion 0.917 0.083
--------------------------------------------------------------------------------
ADATE_11
n missing distinct Info Mean Gmd
8973 1027 4 0.656 9510 0.4812
Value 9508 9509 9510 9511
Frequency 8 2364 6175 426
Proportion 0.001 0.263 0.688 0.047
--------------------------------------------------------------------------------
ADATE_12
n missing distinct Info Mean Gmd
9127 873 4 0.743 9508 0.5122
Value 9507 9508 9509 9510
Frequency 8 5166 3864 89
Proportion 0.001 0.566 0.423 0.010
--------------------------------------------------------------------------------
ADATE_13
n missing distinct Info Mean Gmd
5833 4167 3 0.033 9507 0.06536
Value 9502 9506 9507
Frequency 32 32 5769
Proportion 0.005 0.005 0.989
--------------------------------------------------------------------------------
ADATE_14
n missing distinct Info Mean Gmd
8068 1932 2 0.009 9506 0.01186
Value 9504 9506
Frequency 24 8044
Proportion 0.003 0.997
--------------------------------------------------------------------------------
ADATE_15
n missing distinct Info Mean Gmd
3203 6797 1 0 9504 0
Value 9504
Frequency 3203
Proportion 1
--------------------------------------------------------------------------------
ADATE_16
n missing distinct Info Mean Gmd
7895 2105 3 0.06 9503 0.04034
Value 9502 9503 9504
Frequency 15 7733 147
Proportion 0.002 0.979 0.019
--------------------------------------------------------------------------------
ADATE_17
n missing distinct Info Mean Gmd
7104 2896 3 0.224 9502 0.1494
Value 9501 9502 9503
Frequency 575 6527 2
Proportion 0.081 0.919 0.000
--------------------------------------------------------------------------------
ADATE_18
n missing distinct Info Mean Gmd
7784 2216 8 0.73 9464 43.19
lowest : 9409 9411 9412 9501 9502, highest: 9501 9502 9503 9504 9508
Value 9409 9411 9412 9501 9502 9503 9504 9508
Frequency 3 17 3198 4556 2 3 2 3
Proportion 0.000 0.002 0.411 0.585 0.000 0.000 0.000 0.000
--------------------------------------------------------------------------------
ADATE_19
n missing distinct Info Mean Gmd
7416 2584 3 0.076 9411 0.09807
Value 9409 9410 9411
Frequency 180 13 7223
Proportion 0.024 0.002 0.974
--------------------------------------------------------------------------------
ADATE_20
n missing distinct Info Mean Gmd
4711 5289 1 0 9411 0
Value 9411
Frequency 4711
Proportion 1
--------------------------------------------------------------------------------
ADATE_21
n missing distinct Info Mean Gmd
6327 3673 2 0.217 9410 0.1445
Value 9409 9410
Frequency 496 5831
Proportion 0.078 0.922
--------------------------------------------------------------------------------
ADATE_22
n missing distinct Info Mean Gmd
7305 2695 5 0.246 9410 2.838
lowest : 9408 9409 9410 9411 9506, highest: 9408 9409 9410 9411 9506
Value 9408 9409 9410 9411 9506
Frequency 553 6647 1 1 103
Proportion 0.076 0.910 0.000 0.000 0.014
--------------------------------------------------------------------------------
ADATE_23
n missing distinct Info Mean Gmd
4126 5874 3 0.025 9407 0.199
Value 9312 9406 9407
Frequency 4 31 4091
Proportion 0.001 0.008 0.992
--------------------------------------------------------------------------------
ADATE_24
n missing distinct Info Mean Gmd
6061 3939 2 0.015 9406 0.01018
Value 9405 9406
Frequency 31 6030
Proportion 0.005 0.995
--------------------------------------------------------------------------------
RFA_2
n missing distinct
10000 0 14
lowest : L1E L1F L1G L2E L2F, highest: L3G L4D L4E L4F L4G
Value L1E L1F L1G L2E L2F L2G L3D L3E L3F L3G L4D
Frequency 515 3152 1296 509 1208 517 241 785 368 164 488
Proportion 0.052 0.315 0.130 0.051 0.121 0.052 0.024 0.078 0.037 0.016 0.049
Value L4E L4F L4G
Frequency 432 216 109
Proportion 0.043 0.022 0.011
--------------------------------------------------------------------------------
RFA_3
n missing distinct
9793 207 60
lowest : A1D A1E A1F A1G A2D, highest: S3G S4D S4E S4F S4G
--------------------------------------------------------------------------------
RFA_4
n missing distinct
9763 237 56
lowest : A1D A1E A1F A1G A2D, highest: S3G S4D S4E S4F S4G
--------------------------------------------------------------------------------
RFA_5
n missing distinct
6547 3453 40
lowest : A1F A1G A2E A2F A2G, highest: S3G S4D S4E S4F S4G
--------------------------------------------------------------------------------
RFA_6
n missing distinct
9635 365 91
lowest : A1A A1B A1C A1D A1E, highest: S4F S4G U1D U1E U1G
--------------------------------------------------------------------------------
RFA_7
n missing distinct
9096 904 87
lowest : A1B A1C A1D A1E A1F, highest: S4C S4D S4E S4F S4G
--------------------------------------------------------------------------------
RFA_8
n missing distinct
9669 331 90
lowest : A1B A1C A1D A1E A1F, highest: S4E S4F S4G U1D U1E
--------------------------------------------------------------------------------
RFA_9
n missing distinct
8828 1172 96
lowest : A1A A1B A1C A1D A1E, highest: S4D S4E S4F S4G U1G
--------------------------------------------------------------------------------
RFA_10
n missing distinct
6659 3341 76
lowest : A1C A1D A1E A1F A1G, highest: S4C S4D S4E S4F S4G
--------------------------------------------------------------------------------
RFA_11
n missing distinct
8973 1027 94
lowest : A1B A1C A1D A1E A1F, highest: S4C S4D S4E S4F S4G
--------------------------------------------------------------------------------
RFA_12
n missing distinct
9127 873 98
lowest : A1B A1C A1D A1E A1F, highest: S4G U1D U1E U1F U1G
--------------------------------------------------------------------------------
RFA_13
n missing distinct
5833 4167 63
lowest : A1E A1F A1G A2D A2E, highest: S4D S4E S4F S4G U1D
--------------------------------------------------------------------------------
RFA_14
n missing distinct
8068 1932 89
lowest : A1C A1D A1E A1F A1G, highest: S4F S4G U1C U1D U1G
--------------------------------------------------------------------------------
RFA_15
n missing distinct
3203 6797 33
lowest : A1G A2F A2G A3D A3E, highest: S4D S4E S4F S4G U1D
--------------------------------------------------------------------------------
RFA_16
n missing distinct
7889 2111 109
lowest : A1A A1B A1C A1D A1E, highest: U1C U1D U1E U1F U1G
--------------------------------------------------------------------------------
RFA_17
n missing distinct
7104 2896 109
lowest : A1C A1D A1E A1F A1G, highest: S4F S4G U1E U1F U1G
--------------------------------------------------------------------------------
RFA_18
n missing distinct
7784 2216 112
lowest : A1C A1D A1E A1F A1G, highest: U1C U1D U1E U1F U1G
--------------------------------------------------------------------------------
RFA_19
n missing distinct
7415 2585 101
lowest : A1A A1B A1C A1D A1E, highest: S4D S4E S4F S4G U1E
--------------------------------------------------------------------------------
RFA_20
n missing distinct
4711 5289 69
lowest : A1C A1D A1E A1F A1G, highest: S4C S4D S4E S4F S4G
--------------------------------------------------------------------------------
RFA_21
n missing distinct
6327 3673 96
lowest : A1B A1C A1D A1E A1F, highest: S4C S4D S4E S4F S4G
--------------------------------------------------------------------------------
RFA_22
n missing distinct
7304 2696 109
lowest : A1B A1C A1D A1E A1F, highest: S4G U1C U1D U1E U1F
--------------------------------------------------------------------------------
RFA_23
n missing distinct
4125 5875 63
lowest : 1B 1D 2D A1D A1E, highest: S4D S4E S4F S4G U1E
--------------------------------------------------------------------------------
RFA_24
n missing distinct
6061 3939 84
lowest : A1D A1E A1F A1G A2C, highest: S4G U1C U1D U1E U1G
--------------------------------------------------------------------------------
CARDPROM
n missing distinct Info Mean Gmd .05 .10
10000 0 49 0.999 18.44 9.838 6 7
.25 .50 .75 .90 .95
11 18 25 30 32
lowest : 1 2 3 4 5, highest: 45 46 48 52 53
--------------------------------------------------------------------------------
MAXADATE
n missing distinct Info Mean Gmd
10000 0 4 0.013 9702 0.7541
Value 9611 9612 9701 9702
Frequency 5 37 1 9957
Proportion 0.000 0.004 0.000 0.996
--------------------------------------------------------------------------------
NUMPROM
n missing distinct Info Mean Gmd .05 .10
10000 0 134 1 47.03 25.97 14 16
.25 .50 .75 .90 .95
27 47 64 75 82
lowest : 4 5 6 7 8, highest: 137 142 143 148 157
--------------------------------------------------------------------------------
CARDPM12
n missing distinct Info Mean Gmd .05 .10
10000 0 16 0.828 5.358 1.173 3 4
.25 .50 .75 .90 .95
5 6 6 6 6
lowest : 0 1 2 3 4, highest: 11 12 13 14 16
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 2 30 136 422 1761 1828 5437 166 91 41 23
Proportion 0.000 0.003 0.014 0.042 0.176 0.183 0.544 0.017 0.009 0.004 0.002
Value 11 12 13 14 16
Frequency 52 7 1 1 2
Proportion 0.005 0.001 0.000 0.000 0.000
--------------------------------------------------------------------------------
NUMPRM12
n missing distinct Info Mean Gmd .05 .10
10000 0 47 0.972 12.92 3.916 8 9
.25 .50 .75 .90 .95
11 12 13 15 23
lowest : 2 3 4 5 6, highest: 45 47 48 51 56
--------------------------------------------------------------------------------
RDATE_3
n missing distinct Info Mean Gmd
23 9977 7 0.942 9632 43.9
lowest : 9605 9606 9607 9609 9703, highest: 9607 9609 9703 9704 9803
Value 9605 9606 9607 9609 9703 9704 9803
Frequency 2 7 7 2 1 3 1
Proportion 0.087 0.304 0.304 0.087 0.043 0.130 0.043
--------------------------------------------------------------------------------
RDATE_4
n missing distinct Info Mean Gmd .05 .10
35 9965 14 0.973 9617 53.3 9512 9601
.25 .50 .75 .90 .95
9603 9605 9610 9704 9734
lowest : 9509 9511 9512 9601 9602, highest: 9609 9611 9704 9803 9805
Value 9509 9511 9512 9601 9602 9604 9605 9606 9607 9609 9611
Frequency 1 1 1 3 3 2 10 1 2 2 4
Proportion 0.029 0.029 0.029 0.086 0.086 0.057 0.286 0.029 0.057 0.057 0.114
Value 9704 9803 9805
Frequency 3 1 1
Proportion 0.086 0.029 0.029
--------------------------------------------------------------------------------
RDATE_6
n missing distinct Info Mean Gmd
87 9913 9 0.582 9595 19.77
lowest : 9511 9512 9601 9602 9603, highest: 9603 9604 9605 9607 9703
Value 9511 9512 9601 9602 9603 9604 9605 9607 9703
Frequency 1 8 3 1 65 6 1 1 1
Proportion 0.011 0.092 0.034 0.011 0.747 0.069 0.011 0.011 0.011
--------------------------------------------------------------------------------
RDATE_7
n missing distinct Info Mean Gmd
924 9076 4 0.597 9603 0.4101
Value 9601 9602 9603 9607
Frequency 2 250 671 1
Proportion 0.002 0.271 0.726 0.001
--------------------------------------------------------------------------------
RDATE_8
n missing distinct Info Mean Gmd
2234 7766 9 0.813 9601 1.338
lowest : 9512 9601 9602 9603 9604, highest: 9604 9605 9607 9608 9803
Value 9512 9601 9602 9603 9604 9605 9607 9608 9803
Frequency 6 1067 951 204 2 1 1 1 1
Proportion 0.003 0.478 0.426 0.091 0.001 0.000 0.000 0.000 0.000
--------------------------------------------------------------------------------
RDATE_9
n missing distinct Info Mean Gmd
1737 8263 7 0.485 9525 22.16
lowest : 9509 9510 9511 9512 9601, highest: 9511 9512 9601 9602 9603
Value 9509 9510 9511 9512 9601 9602 9603
Frequency 9 65 23 1391 186 44 19
Proportion 0.005 0.037 0.013 0.801 0.107 0.025 0.011
--------------------------------------------------------------------------------
RDATE_10
n missing distinct Info Mean Gmd
1066 8934 5 0.791 9520 15.72
lowest : 9511 9512 9601 9602 9603, highest: 9511 9512 9601 9602 9603
Value 9511 9512 9601 9602 9603
Frequency 580 386 70 18 12
Proportion 0.544 0.362 0.066 0.017 0.011
--------------------------------------------------------------------------------
RDATE_11
n missing distinct Info Mean Gmd
1606 8394 8 0.861 9517 12.19
lowest : 9509 9510 9511 9512 9601, highest: 9512 9601 9602 9603 9604
Value 9509 9510 9511 9512 9601 9602 9603 9604
Frequency 5 494 762 235 76 23 10 1
Proportion 0.003 0.308 0.474 0.146 0.047 0.014 0.006 0.001
--------------------------------------------------------------------------------
RDATE_12
n missing distinct Info Mean Gmd
2662 7338 7 0.844 9513 7.813
lowest : 9509 9510 9511 9512 9601, highest: 9511 9512 9601 9602 9603
Value 9509 9510 9511 9512 9601 9602 9603
Frequency 1367 706 205 280 67 25 12
Proportion 0.514 0.265 0.077 0.105 0.025 0.009 0.005
--------------------------------------------------------------------------------
RDATE_13
n missing distinct Info Mean Gmd .05 .10
1326 8674 12 0.778 9509 2.834 9507 9507
.25 .50 .75 .90 .95
9508 9508 9508 9509 9510
lowest : 9502 9503 9504 9507 9508, highest: 9511 9512 9601 9602 9603
Value 9502 9503 9504 9507 9508 9509 9510 9511 9512 9601 9602
Frequency 4 3 2 284 789 150 42 18 20 8 3
Proportion 0.003 0.002 0.002 0.214 0.595 0.113 0.032 0.014 0.015 0.006 0.002
Value 9603
Frequency 3
Proportion 0.002
--------------------------------------------------------------------------------
RDATE_14
n missing distinct Info Mean Gmd .05 .10
2422 7578 11 0.782 9508 3.141 9506 9506
.25 .50 .75 .90 .95
9506 9506 9507 9508 9509
lowest : 9505 9506 9507 9508 9509, highest: 9511 9512 9601 9602 9603
Value 9505 9506 9507 9508 9509 9510 9511 9512 9601 9602 9603
Frequency 4 1412 655 149 87 36 26 25 11 10 7
Proportion 0.002 0.583 0.270 0.062 0.036 0.015 0.011 0.010 0.005 0.004 0.003
--------------------------------------------------------------------------------
RDATE_15
n missing distinct Info Mean Gmd .05 .10
763 9237 15 0.699 9506 3.459 9505 9505
.25 .50 .75 .90 .95
9505 9505 9506 9507 9509
lowest : 9412 9501 9502 9503 9504, highest: 9510 9511 9512 9601 9602
Value 9412 9501 9502 9503 9504 9505 9506 9507 9508 9509 9510
Frequency 2 1 2 2 3 507 150 31 24 13 8
Proportion 0.003 0.001 0.003 0.003 0.004 0.664 0.197 0.041 0.031 0.017 0.010
Value 9511 9512 9601 9602
Frequency 7 5 5 3
Proportion 0.009 0.007 0.007 0.004
--------------------------------------------------------------------------------
RDATE_16
n missing distinct Info Mean Gmd .05 .10
2851 7149 17 0.754 9505 2.838 9503 9504
.25 .50 .75 .90 .95
9504 9504 9505 9506 9507
lowest : 9411 9412 9501 9502 9503, highest: 9511 9512 9601 9602 9603
Value 9411 9412 9501 9502 9503 9504 9505 9506 9507 9508 9509
Frequency 1 6 1 2 140 1768 555 182 55 30 26
Proportion 0.000 0.002 0.000 0.001 0.049 0.620 0.195 0.064 0.019 0.011 0.009
Value 9510 9511 9512 9601 9602 9603
Frequency 21 17 28 10 6 3
Proportion 0.007 0.006 0.010 0.004 0.002 0.001
--------------------------------------------------------------------------------
RDATE_17
n missing distinct Info Mean Gmd .05 .10
939 9061 11 0.831 9503 1.275 9502 9502
.25 .50 .75 .90 .95
9502 9503 9503 9505 9506
lowest : 9502 9503 9504 9505 9506, highest: 9508 9509 9510 9511 9512
Value 9502 9503 9504 9505 9506 9507 9508 9509 9510 9511 9512
Frequency 243 499 96 40 24 10 6 5 6 3 7
Proportion 0.259 0.531 0.102 0.043 0.026 0.011 0.006 0.005 0.006 0.003 0.007
--------------------------------------------------------------------------------
RDATE_18
n missing distinct Info Mean Gmd .05 .10
2079 7921 13 0.826 9502 2.258 9501 9501
.25 .50 .75 .90 .95
9501 9501 9502 9505 9507
lowest : 9412 9501 9502 9503 9504, highest: 9508 9509 9510 9511 9512
Value 9412 9501 9502 9503 9504 9505 9506 9507 9508 9509 9510
Frequency 6 1125 520 120 77 36 47 72 29 16 12
Proportion 0.003 0.541 0.250 0.058 0.037 0.017 0.023 0.035 0.014 0.008 0.006
Value 9511 9512
Frequency 11 8
Proportion 0.005 0.004
--------------------------------------------------------------------------------
RDATE_19
n missing distinct Info Mean Gmd .05 .10
1705 8295 12 0.536 9425 22.46 9410 9412
.25 .50 .75 .90 .95
9412 9412 9412 9501 9502
lowest : 9409 9410 9411 9412 9501, highest: 9504 9505 9506 9507 9508
Value 9409 9410 9411 9412 9501 9502 9503 9504 9505 9506 9507
Frequency 2 103 36 1319 149 34 27 11 10 5 7
Proportion 0.001 0.060 0.021 0.774 0.087 0.020 0.016 0.006 0.006 0.003 0.004
Value 9508
Frequency 2
Proportion 0.001
--------------------------------------------------------------------------------
RDATE_20
n missing distinct Info Mean Gmd
841 9159 9 0.761 9422 18
lowest : 9411 9412 9501 9502 9503, highest: 9503 9504 9505 9506 9507
Value 9411 9412 9501 9502 9503 9504 9505 9506 9507
Frequency 246 503 48 18 9 10 4 2 1
Proportion 0.293 0.598 0.057 0.021 0.011 0.012 0.005 0.002 0.001
--------------------------------------------------------------------------------
RDATE_21
n missing distinct Info Mean Gmd .05 .10
1024 8976 11 0.844 9420 16.54 9410 9410
.25 .50 .75 .90 .95
9410 9411 9411 9412 9502
lowest : 9410 9411 9412 9501 9502, highest: 9504 9505 9506 9507 9508
Value 9410 9411 9412 9501 9502 9503 9504 9505 9506 9507 9508
Frequency 272 525 128 34 38 12 5 2 4 2 2
Proportion 0.266 0.513 0.125 0.033 0.037 0.012 0.005 0.002 0.004 0.002 0.002
--------------------------------------------------------------------------------
RDATE_22
n missing distinct Info Mean Gmd .05 .10
2186 7814 12 0.824 9414 8.948 9409 9409
.25 .50 .75 .90 .95
9409 9409 9410 9412 9412
lowest : 9409 9410 9411 9412 9501, highest: 9504 9505 9506 9507 9508
Value 9409 9410 9411 9412 9501 9502 9503 9504 9505 9506 9507
Frequency 1189 531 183 183 56 13 10 6 4 7 1
Proportion 0.544 0.243 0.084 0.084 0.026 0.006 0.005 0.003 0.002 0.003 0.000
Value 9508
Frequency 3
Proportion 0.001
--------------------------------------------------------------------------------
RDATE_23
n missing distinct Info Mean Gmd .05 .10
813 9187 13 0.817 9409 2.994 9407 9407
.25 .50 .75 .90 .95
9408 9408 9408 9409 9410
lowest : 9407 9408 9409 9410 9411, highest: 9503 9504 9505 9506 9507
Value 9407 9408 9409 9410 9411 9412 9501 9502 9503 9504 9505
Frequency 199 447 105 24 13 16 1 3 1 1 1
Proportion 0.245 0.550 0.129 0.030 0.016 0.020 0.001 0.004 0.001 0.001 0.001
Value 9506 9507
Frequency 1 1
Proportion 0.001 0.001
--------------------------------------------------------------------------------
RDATE_24
n missing distinct Info Mean Gmd .05 .10
1841 8159 12 0.857 9408 3.783 9406 9406
.25 .50 .75 .90 .95
9406 9407 9407 9408 9410
lowest : 9310 9406 9407 9408 9409, highest: 9412 9501 9502 9503 9504
Value 9310 9406 9407 9408 9409 9410 9411 9412 9501 9502 9503
Frequency 3 750 777 149 62 35 16 25 8 7 5
Proportion 0.002 0.407 0.422 0.081 0.034 0.019 0.009 0.014 0.004 0.004 0.003
Value 9504
Frequency 4
Proportion 0.002
--------------------------------------------------------------------------------
RAMNT_3
n missing distinct Info Mean Gmd .05 .10
23 9977 10 0.937 12.87 6.237 5.2 7.6
.25 .50 .75 .90 .95
10.0 10.0 15.0 20.0 24.5
lowest : 5 7 10 11 12, highest: 14 15 20 25 30
Value 5 7 10 11 12 14 15 20 25 30
Frequency 2 1 9 1 2 1 3 2 1 1
Proportion 0.087 0.043 0.391 0.043 0.087 0.043 0.130 0.087 0.043 0.043
--------------------------------------------------------------------------------
RAMNT_4
n missing distinct Info Mean Gmd .05 .10
35 9965 15 0.956 14.89 9.671 5.0 5.8
.25 .50 .75 .90 .95
10.0 11.0 18.0 25.6 31.5
lowest : 1 5 7 10 11, highest: 25 26 30 35 50
Value 1 5 7 10 11 12 13 15 16 20 25
Frequency 1 3 1 12 1 1 1 5 1 3 2
Proportion 0.029 0.086 0.029 0.343 0.029 0.029 0.029 0.143 0.029 0.086 0.057
Value 26 30 35 50
Frequency 1 1 1 1
Proportion 0.029 0.029 0.029 0.029
--------------------------------------------------------------------------------
RAMNT_6
n missing distinct Info Mean Gmd .05 .10
87 9913 23 0.988 14 8.937 3.3 5.0
.25 .50 .75 .90 .95
10.0 12.0 19.0 25.0 29.7
lowest : 1 3 4 5 6, highest: 25 29 30 35 50
--------------------------------------------------------------------------------
RAMNT_7
n missing distinct Info Mean Gmd .05 .10
924 9076 42 0.986 14.6 9.106 5 5
.25 .50 .75 .90 .95
10 14 20 25 28
lowest : 1 2 3 4 5, highest: 50 53 57 60 100
--------------------------------------------------------------------------------
RAMNT_8
n missing distinct Info Mean Gmd .05 .10
2234 7766 50 0.986 15.45 8.907 5 5
.25 .50 .75 .90 .95
10 15 20 25 28
lowest : 2 3 4 5 6, highest: 66 100 125 150 250
--------------------------------------------------------------------------------
RAMNT_9
n missing distinct Info Mean Gmd .05 .10
1737 8263 52 0.99 15.23 9.186 5 5
.25 .50 .75 .90 .95
10 15 20 25 27
lowest : 1.0 2.0 2.5 3.0 3.5, highest: 60.0 75.0 80.0 100.0 102.0
--------------------------------------------------------------------------------
RAMNT_10
n missing distinct Info Mean Gmd .05 .10
1066 8934 49 0.988 15.37 9.839 5 5
.25 .50 .75 .90 .95
10 13 20 25 30
lowest : 1.0 2.0 2.5 3.0 4.0, highest: 60.0 62.0 65.0 75.0 100.0
--------------------------------------------------------------------------------
RAMNT_11
n missing distinct Info Mean Gmd .05 .10
1606 8394 52 0.987 15.31 10.39 5 5
.25 .50 .75 .90 .95
10 15 20 25 30
lowest : 1.0 2.0 2.5 3.0 4.0, highest: 80.0 100.0 105.0 117.0 200.0
--------------------------------------------------------------------------------
RAMNT_12
n missing distinct Info Mean Gmd .05 .10
2662 7338 56 0.989 15.09 9.778 5 5
.25 .50 .75 .90 .95
10 13 20 25 30
lowest : 1 2 3 4 5, highest: 70 85 100 200 436
--------------------------------------------------------------------------------
RAMNT_13
n missing distinct Info Mean Gmd .05 .10
1326 8674 55 0.99 14.21 9.779 5 5
.25 .50 .75 .90 .95
8 11 20 25 30
lowest : 2.0 2.5 3.0 4.0 5.0, highest: 65.0 70.0 75.0 100.0 200.0
--------------------------------------------------------------------------------
RAMNT_14
n missing distinct Info Mean Gmd .05 .10
2422 7578 50 0.986 13.31 8.567 5 5
.25 .50 .75 .90 .95
7 11 17 25 25
lowest : 1.00 2.00 2.11 2.50 3.00, highest: 51.00 52.00 56.00 60.00 100.00
--------------------------------------------------------------------------------
RAMNT_15
n missing distinct Info Mean Gmd .05 .10
763 9237 39 0.991 13.24 9.486 4.0 5.0
.25 .50 .75 .90 .95
7.0 10.0 16.5 25.0 30.0
lowest : 2 3 4 5 6, highest: 45 50 60 75 100
--------------------------------------------------------------------------------
RAMNT_16
n missing distinct Info Mean Gmd .05 .10
2851 7149 45 0.99 13.87 8.601 5 5
.25 .50 .75 .90 .95
9 12 17 25 29
lowest : 2.0 3.0 3.5 4.0 5.0, highest: 50.0 56.0 75.0 85.0 100.0
--------------------------------------------------------------------------------
RAMNT_17
n missing distinct Info Mean Gmd .05 .10
939 9061 38 0.98 12.77 8.515 5 5
.25 .50 .75 .90 .95
7 10 15 21 25
lowest : 2 3 4 5 6, highest: 40 45 50 60 100
--------------------------------------------------------------------------------
RAMNT_18
n missing distinct Info Mean Gmd .05 .10
2079 7921 48 0.983 12.46 8.147 4 5
.25 .50 .75 .90 .95
7 10 15 21 25
lowest : 2.0 2.5 3.0 4.0 5.0, highest: 50.0 60.0 62.0 100.0 125.0
--------------------------------------------------------------------------------
RAMNT_19
n missing distinct Info Mean Gmd .05 .10
1705 8295 50 0.99 13.31 8.704 5 5
.25 .50 .75 .90 .95
7 10 16 25 25
lowest : 1 2 3 4 5, highest: 75 82 100 102 200
--------------------------------------------------------------------------------
RAMNT_20
n missing distinct Info Mean Gmd .05 .10
841 9159 42 0.99 14.11 9.156 5 5
.25 .50 .75 .90 .95
9 11 17 25 30
lowest : 2 3 4 5 6, highest: 50 60 75 82 100
--------------------------------------------------------------------------------
RAMNT_21
n missing distinct Info Mean Gmd .05 .10
1024 8976 38 0.978 12.53 8.701 4 5
.25 .50 .75 .90 .95
6 10 15 21 25
lowest : 2 3 4 5 6, highest: 40 45 50 100 200
--------------------------------------------------------------------------------
RAMNT_22
n missing distinct Info Mean Gmd .05 .10
2186 7814 51 0.984 12.48 8.289 4 5
.25 .50 .75 .90 .95
7 10 15 21 25
lowest : 1.0 2.0 2.5 3.0 4.0, highest: 60.0 62.0 70.0 75.0 100.0
--------------------------------------------------------------------------------
RAMNT_23
n missing distinct Info Mean Gmd .05 .10
813 9187 37 0.987 12.36 8.42 3.6 5.0
.25 .50 .75 .90 .95
6.0 10.0 15.0 20.0 25.0
lowest : 2 3 4 5 6, highest: 41 50 55 73 100
--------------------------------------------------------------------------------
RAMNT_24
n missing distinct Info Mean Gmd .05 .10
1841 8159 41 0.983 11.64 7.547 4 5
.25 .50 .75 .90 .95
6 10 15 20 25
lowest : 2 3 4 5 6, highest: 42 45 50 82 200
--------------------------------------------------------------------------------
RAMNTALL
n missing distinct Info Mean Gmd .05 .10
10000 0 679 1 104.8 91.9 20 25
.25 .50 .75 .90 .95
40 79 132 204 271
lowest : 15.00 15.02 16.00 17.00 17.60
highest: 1670.00 1850.00 2210.00 2211.00 2430.00
--------------------------------------------------------------------------------
NGIFTALL
n missing distinct Info Mean Gmd .05 .10
10000 0 65 0.996 9.524 8.559 1 1
.25 .50 .75 .90 .95
3 7 13 20 26
lowest : 1 2 3 4 5, highest: 64 66 69 78 79
--------------------------------------------------------------------------------
CARDGIFT
n missing distinct Info Mean Gmd .05 .10
10000 0 29 0.988 4.987 4.615 0 1
.25 .50 .75 .90 .95
2 4 7 11 14
lowest : 0 1 2 3 4, highest: 24 25 26 27 29
--------------------------------------------------------------------------------
MINRAMNT
n missing distinct Info Mean Gmd .05 .10
10000 0 68 0.94 8.02 6.795 2 3
.25 .50 .75 .90 .95
3 5 10 20 23
lowest : 0.00 0.01 0.02 0.03 0.04, highest: 100.00 105.00 125.00 200.00 436.00
--------------------------------------------------------------------------------
MINRDATE
n missing distinct Info Mean Gmd .05 .10
10000 0 130 1 9255 300.4 8801 8810
.25 .50 .75 .90 .95
9101 9309 9504 9512 9602
lowest : 8506 8507 8607 8608 8609, highest: 9610 9611 9612 9701 9702
--------------------------------------------------------------------------------
MAXRAMNT
n missing distinct Info Mean Gmd .05 .10
10000 0 97 0.982 20.07 10.94 9 10
.25 .50 .75 .90 .95
15 17 25 30 37
lowest : 5.0 6.0 7.0 7.5 8.0, highest: 300.0 400.0 436.0 500.0 1000.0
--------------------------------------------------------------------------------
MAXRDATE
n missing distinct Info Mean Gmd .05 .10
10000 0 128 0.997 9445 156.3 9101 9209
.25 .50 .75 .90 .95
9409 9507 9512 9602 9602
lowest : 8408 8606 8608 8609 8610, highest: 9610 9611 9612 9701 9702
--------------------------------------------------------------------------------
LASTGIFT
n missing distinct Info Mean Gmd .05 .10
10000 0 87 0.984 17.44 10.39 5 7
.25 .50 .75 .90 .95
10 15 20 25 30
lowest : 0.00 0.01 1.00 2.00 3.00
highest: 200.00 250.00 300.00 436.00 1000.00
--------------------------------------------------------------------------------
LASTDATE
n missing distinct Info Mean Gmd .05 .10
10000 0 24 0.984 9548 49.44 9505 9506
.25 .50 .75 .90 .95
9510 9512 9602 9603 9603
lowest : 9503 9504 9505 9506 9507, highest: 9610 9611 9612 9701 9702
--------------------------------------------------------------------------------
FISTDATE
n missing distinct Info Mean Gmd .05 .10
10000 0 128 1 9138 357.9 8610 8701
.25 .50 .75 .90 .95
8811 9202 9409 9509 9601
lowest : 8110 8304 8307 8309 8406, highest: 9511 9512 9601 9602 9603
--------------------------------------------------------------------------------
NEXTDATE
n missing distinct Info Mean Gmd .05 .10
8978 1022 132 1 9153 333.1 8703 8708
.25 .50 .75 .90 .95
8904 9204 9409 9508 9512
lowest : 8201 8405 8406 8411 8412, highest: 9608 9609 9611 9612 9701
--------------------------------------------------------------------------------
TIMELAG
n missing distinct Info Mean Gmd .05 .10
8978 1022 47 0.995 7.961 6.259 1 2
.25 .50 .75 .90 .95
4 6 11 15 20
lowest : 0 1 2 3 4, highest: 43 45 48 49 54
--------------------------------------------------------------------------------
AVGGIFT
n missing distinct Info Mean Gmd .05 .10
10000 0 2326 1 13.47 7.794 5.077 6.077
.25 .50 .75 .90 .95
8.500 11.696 15.600 21.667 25.000
lowest : 1.636364 2.037037 2.250000 2.260870 2.304348
highest: 177.142857 200.000000 220.000000 308.333333 436.000000
--------------------------------------------------------------------------------
CONTROLN
n missing distinct Info Mean Gmd .05 .10
10000 0 10000 1 95236 63996 9161 18334
.25 .50 .75 .90 .95
46671 95653 142997 172030 182307
lowest : 28 89 179 194 203, highest: 191720 191767 191773 191774 191776
--------------------------------------------------------------------------------
HPHONE_D
n missing distinct Info Sum Mean Gmd
10000 0 2 0.75 5064 0.5064 0.5
--------------------------------------------------------------------------------
RFA_2R
n missing distinct value
10000 0 1 L
Value L
Frequency 10000
Proportion 1
--------------------------------------------------------------------------------
RFA_2F
n missing distinct Info Mean Gmd
10000 0 4 0.861 1.909 1.122
Value 1 2 3 4
Frequency 4963 2234 1558 1245
Proportion 0.496 0.223 0.156 0.124
--------------------------------------------------------------------------------
RFA_2A
n missing distinct
10000 0 4
Value D E F G
Frequency 729 2241 4944 2086
Proportion 0.073 0.224 0.494 0.209
--------------------------------------------------------------------------------
MDMAUD_R
n missing distinct
10000 0 5
lowest : C D I L X, highest: C D I L X
Value C D I L X
Frequency 9 6 6 7 9972
Proportion 0.001 0.001 0.001 0.001 0.997
--------------------------------------------------------------------------------
MDMAUD_F
n missing distinct
10000 0 4
Value 1 2 5 X
Frequency 20 7 1 9972
Proportion 0.002 0.001 0.000 0.997
--------------------------------------------------------------------------------
MDMAUD_A
n missing distinct
10000 0 4
Value C L M X
Frequency 23 2 3 9972
Proportion 0.002 0.000 0.000 0.997
--------------------------------------------------------------------------------
CLUSTER2
n missing distinct Info Mean Gmd .05 .10
9985 15 62 1 31.17 21.66 4 5
.25 .50 .75 .90 .95
14 31 49 57 60
lowest : 1 2 3 4 5, highest: 58 59 60 61 62
--------------------------------------------------------------------------------
GEOCODE2
n missing distinct
9971 29 4
Value A B C D
Frequency 3662 2941 1649 1719
Proportion 0.367 0.295 0.165 0.172
--------------------------------------------------------------------------------
Variables with all observations missing:
[1] RDATE_5 RAMNT_5
# Trouver où sont les deux classes dans les données
# Pour cela on va regarder un nombre défini (exemple 100) de variables avec describe
# On regarde si jamais les premières et les dernières sont les mêmes
# Si c'est le cas alors les classes ne sont pas dans ces variables
describe(data.train[,1:100])
data.train[, 1:100]
100 Variables 10000 Observations
--------------------------------------------------------------------------------
ODATEDW
n missing distinct Info Mean Gmd .05 .10
10000 0 16 0.986 9141 390.7 8601 8601
.25 .50 .75 .90 .95
8801 9201 9501 9601 9601
lowest : 8601 8701 8801 8804 8810, highest: 9401 9501 9509 9601 9701
Value 8601 8701 8801 8804 8810 8901 9001 9101 9201 9301 9302
Frequency 1544 369 687 1 1 534 803 925 787 391 1
Proportion 0.154 0.037 0.069 0.000 0.000 0.053 0.080 0.092 0.079 0.039 0.000
Value 9401 9501 9509 9601 9701
Frequency 1325 1595 2 1034 1
Proportion 0.132 0.160 0.000 0.103 0.000
--------------------------------------------------------------------------------
OSOURCE
n missing distinct
9902 98 516
lowest : ABC ABD ABW ACB ACC, highest: WWF YAL YKA YKD YNF
--------------------------------------------------------------------------------
TCODE
n missing distinct Info Mean Gmd .05 .10
10000 0 24 0.895 51.57 100.8 0 0
.25 .50 .75 .90 .95
0 1 2 28 28
lowest : 0 1 2 3 4, highest: 1002 4002 13002 28028 72002
Value 0 200 1000 4000 13000 28000 72000
Frequency 9794 2 193 2 1 7 1
Proportion 0.979 0.000 0.019 0.000 0.000 0.001 0.000
For the frequency table, variable is rounded to the nearest 200
--------------------------------------------------------------------------------
STATE
n missing distinct
10000 0 53
lowest : AA AE AK AL AP, highest: VT WA WI WV WY
--------------------------------------------------------------------------------
ZIP
n missing distinct
10000 0 6323
lowest : 01876 02038 02124 04364 05033, highest: 99803 99821 99824 99833 99901
--------------------------------------------------------------------------------
MAILCODE
n missing distinct value
151 9849 1 B
Value B
Frequency 151
Proportion 1
--------------------------------------------------------------------------------
PVASTATE
n missing distinct
160 9840 2
Value E P
Frequency 1 159
Proportion 0.006 0.994
--------------------------------------------------------------------------------
DOB
n missing distinct Info Mean Gmd .05 .10
10000 0 779 0.985 2722 2430 0 0
.25 .50 .75 .90 .95
301 2607 4601 5601 6111
lowest : 0 1 2 101 103, highest: 7801 7906 8004 8206 8408
--------------------------------------------------------------------------------
NOEXCH
n missing distinct
10000 0 3
Value 0 1 X
Frequency 9961 34 5
Proportion 0.996 0.003 0.000
--------------------------------------------------------------------------------
RECINHSE
n missing distinct value
697 9303 1 X
Value X
Frequency 697
Proportion 1
--------------------------------------------------------------------------------
RECP3
n missing distinct value
193 9807 1 X
Value X
Frequency 193
Proportion 1
--------------------------------------------------------------------------------
RECPGVG
n missing distinct value
11 9989 1 X
Value X
Frequency 11
Proportion 1
--------------------------------------------------------------------------------
RECSWEEP
n missing distinct value
178 9822 1 X
Value X
Frequency 178
Proportion 1
--------------------------------------------------------------------------------
MDMAUD
n missing distinct
10000 0 11
lowest : C1CM C1MM C2CM D1CM D2CM, highest: I1LM I2MM L1CM L2CM XXXX
Value C1CM C1MM C2CM D1CM D2CM I1CM I1LM I2MM L1CM L2CM XXXX
Frequency 2 1 4 2 2 3 1 1 3 1 9980
Proportion 0.000 0.000 0.000 0.000 0.000 0.000 0.000 0.000 0.000 0.000 0.998
--------------------------------------------------------------------------------
DOMAIN
n missing distinct
9781 219 16
lowest : C1 C2 C3 R1 R2, highest: T3 U1 U2 U3 U4
Value C1 C2 C3 R1 R2 R3 S1 S2 S3 T1 T2
Frequency 630 868 566 129 1450 515 1200 898 221 523 1268
Proportion 0.064 0.089 0.058 0.013 0.148 0.053 0.123 0.092 0.023 0.053 0.130
Value T3 U1 U2 U3 U4
Frequency 203 467 337 274 232
Proportion 0.021 0.048 0.034 0.028 0.024
--------------------------------------------------------------------------------
CLUSTER
n missing distinct Info Mean Gmd .05 .10
9781 219 53 0.999 27.85 16.65 4 8
.25 .50 .75 .90 .95
15 28 40 47 50
lowest : 1 2 3 4 5, highest: 49 50 51 52 53
--------------------------------------------------------------------------------
AGE
n missing distinct Info Mean Gmd .05 .10
7544 2456 83 1 61.74 19.14 35 39
.25 .50 .75 .90 .95
48 62 75 83 87
lowest : 13 16 18 19 20, highest: 94 95 96 97 98
--------------------------------------------------------------------------------
AGEFLAG
n missing distinct
6942 3058 2
Value E I
Frequency 6095 847
Proportion 0.878 0.122
--------------------------------------------------------------------------------
HOMEOWNR
n missing distinct
7721 2279 2
Value H U
Frequency 5504 2217
Proportion 0.713 0.287
--------------------------------------------------------------------------------
CHILD03
n missing distinct
124 9876 3
Value B F M
Frequency 5 25 94
Proportion 0.040 0.202 0.758
--------------------------------------------------------------------------------
CHILD07
n missing distinct
149 9851 3
Value B F M
Frequency 8 38 103
Proportion 0.054 0.255 0.691
--------------------------------------------------------------------------------
CHILD12
n missing distinct
197 9803 3
Value B F M
Frequency 18 59 120
Proportion 0.091 0.299 0.609
--------------------------------------------------------------------------------
CHILD18
n missing distinct
285 9715 3
Value B F M
Frequency 20 119 146
Proportion 0.070 0.418 0.512
--------------------------------------------------------------------------------
NUMCHLD
n missing distinct Info Mean Gmd
1322 8678 5 0.733 1.53 0.7571
lowest : 1 2 3 4 5, highest: 1 2 3 4 5
Value 1 2 3 4 5
Frequency 834 329 113 38 8
Proportion 0.631 0.249 0.085 0.029 0.006
--------------------------------------------------------------------------------
INCOME
n missing distinct Info Mean Gmd
7819 2181 7 0.974 3.857 2.111
lowest : 1 2 3 4 5, highest: 3 4 5 6 7
Value 1 2 3 4 5 6 7
Frequency 960 1441 893 1327 1632 793 773
Proportion 0.123 0.184 0.114 0.170 0.209 0.101 0.099
--------------------------------------------------------------------------------
GENDER
n missing distinct
9725 275 5
lowest : C F J M U, highest: C F J M U
Value C F J M U
Frequency 1 5339 47 4164 174
Proportion 0.000 0.549 0.005 0.428 0.018
--------------------------------------------------------------------------------
WEALTH1
n missing distinct Info Mean Gmd .05 .10
5366 4634 10 0.987 5.372 3.127 1 1
.25 .50 .75 .90 .95
3 6 8 9 9
lowest : 0 1 2 3 4, highest: 5 6 7 8 9
Value 0 1 2 3 4 5 6 7 8 9
Frequency 260 338 421 481 529 523 610 641 742 821
Proportion 0.048 0.063 0.078 0.090 0.099 0.097 0.114 0.119 0.138 0.153
--------------------------------------------------------------------------------
HIT
n missing distinct Info Mean Gmd .05 .10
10000 0 58 0.802 3.186 5.169 0 0
.25 .50 .75 .90 .95
0 0 3 10 16
lowest : 0 1 2 3 4, highest: 53 59 60 65 240
--------------------------------------------------------------------------------
MBCRAFT
n missing distinct Info Mean Gmd
4490 5510 6 0.3 0.1465 0.2665
lowest : 0 1 2 3 4, highest: 1 2 3 4 6
Value 0 1 2 3 4 6
Frequency 3986 386 88 26 3 1
Proportion 0.888 0.086 0.020 0.006 0.001 0.000
--------------------------------------------------------------------------------
MBGARDEN
n missing distinct Info Mean Gmd
4490 5510 4 0.139 0.05523 0.1057
Value 0 1 2 3
Frequency 4271 194 21 4
Proportion 0.951 0.043 0.005 0.001
--------------------------------------------------------------------------------
MBBOOKS
n missing distinct Info Mean Gmd .05 .10
4490 5510 10 0.847 1.066 1.488 0 0
.25 .50 .75 .90 .95
0 0 1 3 5
lowest : 0 1 2 3 4, highest: 5 6 7 8 9
Value 0 1 2 3 4 5 6 7 8 9
Frequency 2314 1090 481 235 142 90 49 32 18 39
Proportion 0.515 0.243 0.107 0.052 0.032 0.020 0.011 0.007 0.004 0.009
--------------------------------------------------------------------------------
MBCOLECT
n missing distinct Info Mean Gmd
4484 5516 4 0.135 0.05531 0.1061
Value 0 1 2 3
Frequency 4273 181 23 7
Proportion 0.953 0.040 0.005 0.002
--------------------------------------------------------------------------------
MAGFAML
n missing distinct Info Mean Gmd
4490 5510 9 0.645 0.4327 0.669
lowest : 0 1 2 3 4, highest: 4 5 6 7 9
Value 0 1 2 3 4 5 6 7 9
Frequency 3151 927 279 94 29 4 4 1 1
Proportion 0.702 0.206 0.062 0.021 0.006 0.001 0.001 0.000 0.000
--------------------------------------------------------------------------------
MAGFEM
n missing distinct Info Mean Gmd
4490 5510 5 0.283 0.1209 0.2195
lowest : 0 1 2 3 4, highest: 0 1 2 3 4
Value 0 1 2 3 4
Frequency 4018 413 48 10 1
Proportion 0.895 0.092 0.011 0.002 0.000
--------------------------------------------------------------------------------
MAGMALE
n missing distinct Info Mean Gmd
4490 5510 4 0.175 0.06837 0.1289
Value 0 1 2 3
Frequency 4210 256 21 3
Proportion 0.938 0.057 0.005 0.001
--------------------------------------------------------------------------------
PUBGARDN
n missing distinct Info Mean Gmd
4490 5510 6 0.271 0.1425 0.2631
lowest : 0 1 2 3 4, highest: 1 2 3 4 5
Value 0 1 2 3 4 5
Frequency 4041 313 97 26 10 3
Proportion 0.900 0.070 0.022 0.006 0.002 0.001
--------------------------------------------------------------------------------
PUBCULIN
n missing distinct Info Mean Gmd
4490 5510 5 0.3 0.1343 0.2427
lowest : 0 1 2 3 4, highest: 0 1 2 3 4
Value 0 1 2 3 4
Frequency 3985 424 67 11 3
Proportion 0.888 0.094 0.015 0.002 0.001
--------------------------------------------------------------------------------
PUBHLTH
n missing distinct Info Mean Gmd .05 .10
4490 5510 10 0.72 0.694 1.062 0 0
.25 .50 .75 .90 .95
0 0 1 2 3
lowest : 0 1 2 3 4, highest: 5 6 7 8 9
Value 0 1 2 3 4 5 6 7 8 9
Frequency 2913 817 350 201 112 58 24 8 5 2
Proportion 0.649 0.182 0.078 0.045 0.025 0.013 0.005 0.002 0.001 0.000
--------------------------------------------------------------------------------
PUBDOITY
n missing distinct Info Mean Gmd
4490 5510 9 0.362 0.2359 0.4265
lowest : 0 1 2 3 4, highest: 4 5 6 7 8
Value 0 1 2 3 4 5 6 7 8
Frequency 3864 398 134 36 27 19 6 2 4
Proportion 0.861 0.089 0.030 0.008 0.006 0.004 0.001 0.000 0.001
--------------------------------------------------------------------------------
PUBNEWFN
n missing distinct Info Mean Gmd .05 .10
4490 5510 10 0.509 0.3737 0.6393 0 0
.25 .50 .75 .90 .95
0 0 0 1 2
lowest : 0 1 2 3 4, highest: 5 6 7 8 9
Value 0 1 2 3 4 5 6 7 8 9
Frequency 3538 569 216 78 44 25 8 6 2 4
Proportion 0.788 0.127 0.048 0.017 0.010 0.006 0.002 0.001 0.000 0.001
--------------------------------------------------------------------------------
PUBPHOTO
n missing distinct Info Sum Mean Gmd
4490 5510 2 0.015 23 0.005122 0.01019
--------------------------------------------------------------------------------
PUBOPP
n missing distinct Info Mean Gmd .05 .10
4490 5510 10 0.337 0.245 0.4506 0 0
.25 .50 .75 .90 .95
0 0 0 1 1
lowest : 0 1 2 3 4, highest: 5 6 7 8 9
Value 0 1 2 3 4 5 6 7 8 9
Frequency 3914 364 91 68 11 7 12 3 2 18
Proportion 0.872 0.081 0.020 0.015 0.002 0.002 0.003 0.001 0.000 0.004
--------------------------------------------------------------------------------
DATASRCE
n missing distinct Info Mean Gmd
7821 2179 3 0.766 2.487 0.6647
Value 1 2 3
Frequency 780 2455 4586
Proportion 0.100 0.314 0.586
--------------------------------------------------------------------------------
MALEMILI
n missing distinct Info Mean Gmd .05 .10
10000 0 58 0.533 1.034 1.894 0 0
.25 .50 .75 .90 .95
0 0 0 2 4
lowest : 0 1 2 3 4, highest: 92 93 94 96 97
--------------------------------------------------------------------------------
MALEVET
n missing distinct Info Mean Gmd .05 .10
10000 0 81 0.999 30.56 12.56 10 17
.25 .50 .75 .90 .95
25 31 37 44 49
lowest : 0 1 2 3 4, highest: 78 79 82 84 85
--------------------------------------------------------------------------------
VIETVETS
n missing distinct Info Mean Gmd .05 .10
10000 0 91 0.999 29.84 16.82 0 11
.25 .50 .75 .90 .95
21 30 39 49 55
lowest : 0 1 2 3 4, highest: 88 90 91 93 99
--------------------------------------------------------------------------------
WWIIVETS
n missing distinct Info Mean Gmd .05 .10
10000 0 98 1 32.56 19.64 0 10
.25 .50 .75 .90 .95
21 32 43 55 63
lowest : 0 1 2 3 4, highest: 94 95 96 97 99
--------------------------------------------------------------------------------
LOCALGOV
n missing distinct Info Mean Gmd .05 .10
10000 0 37 0.993 6.867 4.62 0 2
.25 .50 .75 .90 .95
4 6 9 12 15
lowest : 0 1 2 3 4, highest: 34 38 43 44 57
--------------------------------------------------------------------------------
STATEGOV
n missing distinct Info Mean Gmd .05 .10
10000 0 48 0.988 4.592 4.78 0 0
.25 .50 .75 .90 .95
1 3 6 10 14
lowest : 0 1 2 3 4, highest: 45 46 48 50 58
--------------------------------------------------------------------------------
FEDGOV
n missing distinct Info Mean Gmd .05 .10
10000 0 47 0.975 3.182 3.623 0 0
.25 .50 .75 .90 .95
1 2 4 7 10
lowest : 0 1 2 3 4, highest: 44 46 47 49 50
--------------------------------------------------------------------------------
SOLP3
n missing distinct Info Mean Gmd
18 9982 3 0.793 4.111 5.699
Value 0 1 12
Frequency 10 2 6
Proportion 0.556 0.111 0.333
--------------------------------------------------------------------------------
SOLIH
n missing distinct Info Mean Gmd
650 9350 5 0.177 11.3 1.321
lowest : 0 1 2 4 12, highest: 0 1 2 4 12
Value 0 1 2 4 12
Frequency 23 2 15 1 609
Proportion 0.035 0.003 0.023 0.002 0.937
--------------------------------------------------------------------------------
MAJOR
n missing distinct value
20 9980 1 X
Value X
Frequency 20
Proportion 1
--------------------------------------------------------------------------------
WEALTH2
n missing distinct Info Mean Gmd .05 .10
5423 4577 10 0.989 4.982 3.2 0 1
.25 .50 .75 .90 .95
3 5 7 9 9
lowest : 0 1 2 3 4, highest: 5 6 7 8 9
Value 0 1 2 3 4 5 6 7 8 9
Frequency 333 419 511 554 552 573 552 613 604 712
Proportion 0.061 0.077 0.094 0.102 0.102 0.106 0.102 0.113 0.111 0.131
--------------------------------------------------------------------------------
GEOCODE
n missing distinct Info Mean Gmd
1556 8444 7 0.967 6.77 5.088
lowest : 1 2 3 4 5, highest: 3 4 5 12 14
Value 1 2 3 4 5 12 14
Frequency 112 136 339 182 203 393 191
Proportion 0.072 0.087 0.218 0.117 0.130 0.253 0.123
--------------------------------------------------------------------------------
COLLECT1
n missing distinct value
585 9415 1 Y
Value Y
Frequency 585
Proportion 1
--------------------------------------------------------------------------------
VETERANS
n missing distinct value
1085 8915 1 Y
Value Y
Frequency 1085
Proportion 1
--------------------------------------------------------------------------------
BIBLE
n missing distinct value
988 9012 1 Y
Value Y
Frequency 988
Proportion 1
--------------------------------------------------------------------------------
CATLG
n missing distinct value
850 9150 1 Y
Value Y
Frequency 850
Proportion 1
--------------------------------------------------------------------------------
HOMEE
n missing distinct value
102 9898 1 Y
Value Y
Frequency 102
Proportion 1
--------------------------------------------------------------------------------
PETS
n missing distinct value
1541 8459 1 Y
Value Y
Frequency 1541
Proportion 1
--------------------------------------------------------------------------------
CDPLAY
n missing distinct value
1325 8675 1 Y
Value Y
Frequency 1325
Proportion 1
--------------------------------------------------------------------------------
STEREO
n missing distinct value
1371 8629 1 Y
Value Y
Frequency 1371
Proportion 1
--------------------------------------------------------------------------------
PCOWNERS
n missing distinct value
1122 8878 1 Y
Value Y
Frequency 1122
Proportion 1
--------------------------------------------------------------------------------
PHOTO
n missing distinct value
523 9477 1 Y
Value Y
Frequency 523
Proportion 1
--------------------------------------------------------------------------------
CRAFTS
n missing distinct value
919 9081 1 Y
Value Y
Frequency 919
Proportion 1
--------------------------------------------------------------------------------
FISHER
n missing distinct value
762 9238 1 Y
Value Y
Frequency 762
Proportion 1
--------------------------------------------------------------------------------
GARDENIN
n missing distinct value
1387 8613 1 Y
Value Y
Frequency 1387
Proportion 1
--------------------------------------------------------------------------------
BOATS
n missing distinct value
208 9792 1 Y
Value Y
Frequency 208
Proportion 1
--------------------------------------------------------------------------------
WALKER
n missing distinct value
1054 8946 1 Y
Value Y
Frequency 1054
Proportion 1
--------------------------------------------------------------------------------
KIDSTUFF
n missing distinct value
160 9840 1 Y
Value Y
Frequency 160
Proportion 1
--------------------------------------------------------------------------------
CARDS
n missing distinct value
104 9896 1 Y
Value Y
Frequency 104
Proportion 1
--------------------------------------------------------------------------------
PLATES
n missing distinct value
47 9953 1 Y
Value Y
Frequency 47
Proportion 1
--------------------------------------------------------------------------------
LIFESRC
n missing distinct Info Mean Gmd
4398 5602 3 0.855 2.038 0.7708
Value 1 2 3
Frequency 1066 2100 1232
Proportion 0.242 0.477 0.280
--------------------------------------------------------------------------------
PEPSTRFL
n missing distinct value
4653 5347 1 X
Value X
Frequency 4653
Proportion 1
--------------------------------------------------------------------------------
POP901
n missing distinct Info Mean Gmd .05 .10
10000 0 4320 1 3279 3790 529 691
.25 .50 .75 .90 .95
987 1566 3046 6303 10811
lowest : 0 3 4 6 10, highest: 80599 80680 90832 96371 98701
--------------------------------------------------------------------------------
POP902
n missing distinct Info Mean Gmd .05 .10
10000 0 2259 1 869.6 1000 132 180
.25 .50 .75 .90 .95
263 422 824 1723 2946
lowest : 0 1 2 3 5, highest: 19094 20063 20621 21703 23766
--------------------------------------------------------------------------------
POP903
n missing distinct Info Mean Gmd .05 .10
10000 0 2659 1 1229 1413 203 269
.25 .50 .75 .90 .95
374 586 1142 2381 4032
lowest : 0 1 2 3 4, highest: 27674 28633 29306 30492 31167
--------------------------------------------------------------------------------
POP90C1
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.796 58.98 47.6 0 0
.25 .50 .75 .90 .95
0 99 99 99 99
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
POP90C2
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.484 13.5 23.11 0 0
.25 .50 .75 .90 .95
0 0 0 83 99
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
POP90C3
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.779 25.84 37.63 0 0
.25 .50 .75 .90 .95
0 0 52 99 99
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
POP90C4
n missing distinct Info Mean Gmd .05 .10
10000 0 59 0.981 48.19 3.969 43 45
.25 .50 .75 .90 .95
47 49 50 51 52
lowest : 0 14 19 21 27, highest: 79 87 89 91 99
--------------------------------------------------------------------------------
POP90C5
n missing distinct Info Mean Gmd .05 .10
10000 0 58 0.981 50.92 4.018 47 49
.25 .50 .75 .90 .95
50 51 53 55 56
lowest : 0 9 11 13 21, highest: 72 73 79 81 86
--------------------------------------------------------------------------------
ETH1
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.994 84.79 18.75 36 60
.25 .50 .75 .90 .95
82 93 98 99 99
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
ETH2
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.958 7.57 11.93 0 0
.25 .50 .75 .90 .95
0 1 6 21 40
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
ETH3
n missing distinct Info Mean Gmd .05 .10
10000 0 43 0.715 0.7731 1.265 0 0
.25 .50 .75 .90 .95
0 0 1 1 2
lowest : 0 1 2 3 4, highest: 92 95 96 98 99
--------------------------------------------------------------------------------
ETH4
n missing distinct Info Mean Gmd .05 .10
10000 0 74 0.924 2.827 4.27 0 0
.25 .50 .75 .90 .95
0 1 2 7 12
lowest : 0 1 2 3 4, highest: 85 87 89 90 91
--------------------------------------------------------------------------------
ETH5
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.979 7.584 10.75 0.00 0.00
.25 .50 .75 .90 .95
1.00 2.00 7.00 20.00 35.05
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
ETH6
n missing distinct Info Mean Gmd .05 .10
10000 0 12 0.395 0.2157 0.3806 0 0
.25 .50 .75 .90 .95
0 0 0 1 1
lowest : 0 1 2 3 4, highest: 7 8 9 10 22
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 8450 1200 226 71 28 8 6 4 2 2 2
Proportion 0.845 0.120 0.023 0.007 0.003 0.001 0.001 0.000 0.000 0.000 0.000
Value 22
Frequency 1
Proportion 0.000
--------------------------------------------------------------------------------
ETH7
n missing distinct Info Mean Gmd .05 .10
10000 0 36 0.428 0.3643 0.6632 0 0
.25 .50 .75 .90 .95
0 0 0 1 2
lowest : 0 1 2 3 4, highest: 46 53 55 57 59
--------------------------------------------------------------------------------
ETH8
n missing distinct Info Mean Gmd .05 .10
10000 0 38 0.523 0.594 1.065 0 0
.25 .50 .75 .90 .95
0 0 0 1 2
lowest : 0 1 2 3 4, highest: 36 38 40 45 61
--------------------------------------------------------------------------------
ETH9
n missing distinct Info Mean Gmd .05 .10
10000 0 35 0.492 0.5407 0.9726 0 0
.25 .50 .75 .90 .95
0 0 0 1 3
lowest : 0 1 2 3 4, highest: 42 43 48 56 58
--------------------------------------------------------------------------------
ETH10
n missing distinct Info Mean Gmd .05 .10
10000 0 18 0.362 0.2301 0.4171 0 0
.25 .50 .75 .90 .95
0 0 0 1 1
lowest : 0 1 2 3 4, highest: 14 17 18 22 28
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 8602 1000 228 74 32 24 14 7 6 2 3
Proportion 0.860 0.100 0.023 0.007 0.003 0.002 0.001 0.001 0.001 0.000 0.000
Value 12 13 14 17 18 22 28
Frequency 1 1 1 1 1 2 1
Proportion 0.000 0.000 0.000 0.000 0.000 0.000 0.000
--------------------------------------------------------------------------------
ETH11
n missing distinct Info Mean Gmd .05 .10
10000 0 21 0.29 0.2146 0.4019 0 0
.25 .50 .75 .90 .95
0 0 0 1 1
lowest : 0 1 2 3 4, highest: 16 17 22 30 45
--------------------------------------------------------------------------------
ETH12
n missing distinct Info Mean Gmd .05 .10
10000 0 21 0.034 0.0566 0.1128 0 0
.25 .50 .75 .90 .95
0 0 0 0 0
lowest : 0 1 2 3 5, highest: 19 23 42 66 72
--------------------------------------------------------------------------------
ETH13
n missing distinct Info Mean Gmd .05 .10
10000 0 94 0.944 5.217 8.088 0 0
.25 .50 .75 .90 .95
0 1 4 14 26
lowest : 0 1 2 3 4, highest: 90 91 92 94 95
--------------------------------------------------------------------------------
ETH14
n missing distinct Info Mean Gmd .05 .10
10000 0 20 0.414 0.2821 0.505 0 0
.25 .50 .75 .90 .95
0 0 0 1 1
lowest : 0 1 2 3 4, highest: 16 17 20 26 28
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 8358 1225 186 91 51 32 11 11 8 2 6
Proportion 0.836 0.122 0.019 0.009 0.005 0.003 0.001 0.001 0.001 0.000 0.001
Value 11 12 13 14 16 17 20 26 28
Frequency 4 4 1 1 2 1 4 1 1
Proportion 0.000 0.000 0.000 0.000 0.000 0.000 0.000 0.000 0.000
--------------------------------------------------------------------------------
ETH15
n missing distinct Info Mean Gmd .05 .10
10000 0 46 0.217 0.387 0.7573 0 0
.25 .50 .75 .90 .95
0 0 0 0 1
lowest : 0 1 2 3 4, highest: 68 71 72 77 78
--------------------------------------------------------------------------------
ETH16
n missing distinct Info Mean Gmd .05 .10
10000 0 45 0.887 1.515 2.175 0 0
.25 .50 .75 .90 .95
0 1 2 4 5
lowest : 0 1 2 3 4, highest: 47 48 52 61 68
--------------------------------------------------------------------------------
AGE901
n missing distinct Info Mean Gmd .05 .10
10000 0 63 0.997 34.39 7.982 26 27
.25 .50 .75 .90 .95
30 33 37 42 48
lowest : 0 14 17 19 20, highest: 74 75 76 77 79
--------------------------------------------------------------------------------
describe(data.val[,1:100])
data.val[, 1:100]
100 Variables 10000 Observations
--------------------------------------------------------------------------------
ODATEDW
n missing distinct Info Mean Gmd .05 .10
10000 0 18 0.987 9144 388.4 8601 8601
.25 .50 .75 .90 .95
8801 9201 9501 9601 9601
lowest : 8601 8610 8701 8801 8901, highest: 9410 9501 9506 9601 9701
Value 8601 8610 8701 8801 8901 8909 9001 9101 9102 9110 9201
Frequency 1441 2 392 724 579 1 798 897 1 1 817
Proportion 0.144 0.000 0.039 0.072 0.058 0.000 0.080 0.090 0.000 0.000 0.082
Value 9301 9401 9410 9501 9506 9601 9701
Frequency 424 1211 1 1680 1 1029 1
Proportion 0.042 0.121 0.000 0.168 0.000 0.103 0.000
--------------------------------------------------------------------------------
OSOURCE
n missing distinct
9901 99 542
lowest : AAA AAD ABC ABD ABW, highest: WWF WWM YKA YKD ZOY
--------------------------------------------------------------------------------
TCODE
n missing distinct Info Mean Gmd .05 .10
10000 0 20 0.895 35.93 69.44 0 0
.25 .50 .75 .90 .95
0 1 2 28 28
lowest : 0 1 2 3 4, highest: 980 1002 4002 14002 28028
Value 0 1 2 3 4 10 13 22 28 30 42
Frequency 4297 2671 1825 78 29 1 4 1 852 1 6
Proportion 0.430 0.267 0.182 0.008 0.003 0.000 0.000 0.000 0.085 0.000 0.001
Value 45 72 93 116 980 1002 4002 14002 28028
Frequency 2 9 1 2 6 207 4 1 3
Proportion 0.000 0.001 0.000 0.000 0.001 0.021 0.000 0.000 0.000
--------------------------------------------------------------------------------
STATE
n missing distinct
10000 0 53
lowest : AA AK AL AP AR, highest: VA WA WI WV WY
--------------------------------------------------------------------------------
ZIP
n missing distinct
10000 0 6277
lowest : 00778 01030- 01776- 01930 02215 , highest: 99755 99762 99801 99824 99833
--------------------------------------------------------------------------------
MAILCODE
n missing distinct value
138 9862 1 B
Value B
Frequency 138
Proportion 1
--------------------------------------------------------------------------------
PVASTATE
n missing distinct
145 9855 2
Value E P
Frequency 1 144
Proportion 0.007 0.993
--------------------------------------------------------------------------------
DOB
n missing distinct Info Mean Gmd .05 .10
10000 0 785 0.985 2703 2427 0 0
.25 .50 .75 .90 .95
201 2601 4501 5606 6201
lowest : 0 1 4 8 10, highest: 7512 7601 7908 9605 9611
--------------------------------------------------------------------------------
NOEXCH
n missing distinct
9996 4 3
Value 0 1 X
Frequency 9969 25 2
Proportion 0.997 0.003 0.000
--------------------------------------------------------------------------------
RECINHSE
n missing distinct value
748 9252 1 X
Value X
Frequency 748
Proportion 1
--------------------------------------------------------------------------------
RECP3
n missing distinct value
218 9782 1 X
Value X
Frequency 218
Proportion 1
--------------------------------------------------------------------------------
RECPGVG
n missing distinct value
15 9985 1 X
Value X
Frequency 15
Proportion 1
--------------------------------------------------------------------------------
RECSWEEP
n missing distinct value
164 9836 1 X
Value X
Frequency 164
Proportion 1
--------------------------------------------------------------------------------
MDMAUD
n missing distinct
10000 0 12
lowest : C1CM C1LM C2CM C2MM D1CM, highest: D5MM I1CM I2CM L1CM XXXX
Value C1CM C1LM C2CM C2MM D1CM D1MM D2CM D5MM I1CM I2CM L1CM
Frequency 4 2 2 1 2 1 2 1 4 2 7
Proportion 0.000 0.000 0.000 0.000 0.000 0.000 0.000 0.000 0.000 0.000 0.001
Value XXXX
Frequency 9972
Proportion 0.997
--------------------------------------------------------------------------------
DOMAIN
n missing distinct
9752 248 16
lowest : C1 C2 C3 R1 R2, highest: T3 U1 U2 U3 U4
Value C1 C2 C3 R1 R2 R3 S1 S2 S3 T1 T2
Frequency 692 845 533 135 1434 486 1190 898 182 541 1252
Proportion 0.071 0.087 0.055 0.014 0.147 0.050 0.122 0.092 0.019 0.055 0.128
Value T3 U1 U2 U3 U4
Frequency 233 512 345 294 180
Proportion 0.024 0.053 0.035 0.030 0.018
--------------------------------------------------------------------------------
CLUSTER
n missing distinct Info Mean Gmd .05 .10
9752 248 53 0.999 27.71 16.74 3 8
.25 .50 .75 .90 .95
15 27 40 47 49
lowest : 1 2 3 4 5, highest: 49 50 51 52 53
--------------------------------------------------------------------------------
AGE
n missing distinct Info Mean Gmd .05 .10
7516 2484 80 1 61.85 19.14 34 39
.25 .50 .75 .90 .95
48 63 76 83 87
lowest : 1 2 18 22 23, highest: 94 95 96 97 98
--------------------------------------------------------------------------------
AGEFLAG
n missing distinct
6827 3173 2
Value E I
Frequency 5949 878
Proportion 0.871 0.129
--------------------------------------------------------------------------------
HOMEOWNR
n missing distinct
7654 2346 2
Value H U
Frequency 5480 2174
Proportion 0.716 0.284
--------------------------------------------------------------------------------
CHILD03
n missing distinct
147 9853 3
Value B F M
Frequency 7 39 101
Proportion 0.048 0.265 0.687
--------------------------------------------------------------------------------
CHILD07
n missing distinct
135 9865 3
Value B F M
Frequency 7 38 90
Proportion 0.052 0.281 0.667
--------------------------------------------------------------------------------
CHILD12
n missing distinct
181 9819 3
Value B F M
Frequency 12 44 125
Proportion 0.066 0.243 0.691
--------------------------------------------------------------------------------
CHILD18
n missing distinct
290 9710 3
Value B F M
Frequency 35 109 146
Proportion 0.121 0.376 0.503
--------------------------------------------------------------------------------
NUMCHLD
n missing distinct Info Mean Gmd
1238 8762 5 0.732 1.519 0.7378
lowest : 1 2 3 4 5, highest: 1 2 3 4 5
Value 1 2 3 4 5
Frequency 781 312 111 27 7
Proportion 0.631 0.252 0.090 0.022 0.006
--------------------------------------------------------------------------------
INCOME
n missing distinct Info Mean Gmd
7769 2231 7 0.974 3.909 2.101
lowest : 1 2 3 4 5, highest: 3 4 5 6 7
Value 1 2 3 4 5 6 7
Frequency 912 1364 906 1308 1672 831 776
Proportion 0.117 0.176 0.117 0.168 0.215 0.107 0.100
--------------------------------------------------------------------------------
GENDER
n missing distinct
9701 299 4
Value F J M U
Frequency 5399 52 4061 189
Proportion 0.557 0.005 0.419 0.019
--------------------------------------------------------------------------------
WEALTH1
n missing distinct Info Mean Gmd .05 .10
5321 4679 10 0.987 5.385 3.133 1 1
.25 .50 .75 .90 .95
3 6 8 9 9
lowest : 0 1 2 3 4, highest: 5 6 7 8 9
Value 0 1 2 3 4 5 6 7 8 9
Frequency 264 345 394 500 478 519 598 667 745 811
Proportion 0.050 0.065 0.074 0.094 0.090 0.098 0.112 0.125 0.140 0.152
--------------------------------------------------------------------------------
HIT
n missing distinct Info Mean Gmd .05 .10
10000 0 59 0.802 3.24 5.23 0 0
.25 .50 .75 .90 .95
0 0 3 11 17
lowest : 0 1 2 3 4, highest: 54 55 59 63 240
--------------------------------------------------------------------------------
MBCRAFT
n missing distinct Info Mean Gmd
4484 5516 5 0.306 0.1521 0.2759
lowest : 0 1 2 3 4, highest: 0 1 2 3 4
Value 0 1 2 3 4
Frequency 3968 383 106 21 6
Proportion 0.885 0.085 0.024 0.005 0.001
--------------------------------------------------------------------------------
MBGARDEN
n missing distinct Info Mean Gmd
4484 5516 4 0.152 0.05955 0.1133
Value 0 1 2 3
Frequency 4244 214 25 1
Proportion 0.946 0.048 0.006 0.000
--------------------------------------------------------------------------------
MBBOOKS
n missing distinct Info Mean Gmd .05 .10
4484 5516 10 0.86 1.101 1.506 0 0
.25 .50 .75 .90 .95
0 1 2 3 5
lowest : 0 1 2 3 4, highest: 5 6 7 8 9
Value 0 1 2 3 4 5 6 7 8 9
Frequency 2235 1105 496 278 138 92 52 41 26 21
Proportion 0.498 0.246 0.111 0.062 0.031 0.021 0.012 0.009 0.006 0.005
--------------------------------------------------------------------------------
MBCOLECT
n missing distinct Info Mean Gmd
4482 5518 6 0.137 0.05935 0.1139
lowest : 0 1 2 3 4, highest: 1 2 3 4 5
Value 0 1 2 3 4 5
Frequency 4267 173 36 4 1 1
Proportion 0.952 0.039 0.008 0.001 0.000 0.000
--------------------------------------------------------------------------------
MAGFAML
n missing distinct Info Mean Gmd
4484 5516 7 0.646 0.4509 0.7003
lowest : 0 1 2 3 4, highest: 2 3 4 5 6
Value 0 1 2 3 4 5 6
Frequency 3150 863 316 110 30 13 2
Proportion 0.702 0.192 0.070 0.025 0.007 0.003 0.000
--------------------------------------------------------------------------------
MAGFEM
n missing distinct Info Mean Gmd
4484 5516 4 0.307 0.128 0.2291
Value 0 1 2 3
Frequency 3966 464 52 2
Proportion 0.884 0.103 0.012 0.000
--------------------------------------------------------------------------------
MAGMALE
n missing distinct Info Mean Gmd
4484 5516 4 0.159 0.06178 0.1173
Value 0 1 2 3
Frequency 4233 227 22 2
Proportion 0.944 0.051 0.005 0.000
--------------------------------------------------------------------------------
PUBGARDN
n missing distinct Info Mean Gmd
4484 5516 5 0.273 0.1405 0.2588
lowest : 0 1 2 3 4, highest: 0 1 2 3 4
Value 0 1 2 3 4
Frequency 4032 318 98 28 8
Proportion 0.899 0.071 0.022 0.006 0.002
--------------------------------------------------------------------------------
PUBCULIN
n missing distinct Info Mean Gmd
4484 5516 5 0.325 0.1476 0.264
lowest : 0 1 2 3 4, highest: 0 1 2 3 4
Value 0 1 2 3 4
Frequency 3931 457 85 9 2
Proportion 0.877 0.102 0.019 0.002 0.000
--------------------------------------------------------------------------------
PUBHLTH
n missing distinct Info Mean Gmd .05 .10
4484 5516 10 0.732 0.7268 1.105 0 0
.25 .50 .75 .90 .95
0 0 1 2 4
lowest : 0 1 2 3 4, highest: 5 6 7 8 9
Value 0 1 2 3 4 5 6 7 8 9
Frequency 2865 833 344 201 132 66 31 8 1 3
Proportion 0.639 0.186 0.077 0.045 0.029 0.015 0.007 0.002 0.000 0.001
--------------------------------------------------------------------------------
PUBDOITY
n missing distinct Info Mean Gmd
4484 5516 9 0.359 0.2366 0.4284
lowest : 0 1 2 3 4, highest: 4 5 6 7 9
Value 0 1 2 3 4 5 6 7 9
Frequency 3865 387 130 45 26 17 10 3 1
Proportion 0.862 0.086 0.029 0.010 0.006 0.004 0.002 0.001 0.000
--------------------------------------------------------------------------------
PUBNEWFN
n missing distinct Info Mean Gmd .05 .10
4484 5516 10 0.502 0.3776 0.6479 0 0
.25 .50 .75 .90 .95
0 0 0 1 2
lowest : 0 1 2 3 4, highest: 5 6 7 8 9
Value 0 1 2 3 4 5 6 7 8 9
Frequency 3550 526 233 90 42 22 7 5 5 4
Proportion 0.792 0.117 0.052 0.020 0.009 0.005 0.002 0.001 0.001 0.001
--------------------------------------------------------------------------------
PUBPHOTO
n missing distinct Info Sum Mean Gmd
4484 5516 2 0.015 22 0.004906 0.009767
--------------------------------------------------------------------------------
PUBOPP
n missing distinct Info Mean Gmd
4484 5516 9 0.333 0.2252 0.4127
lowest : 0 1 2 3 4, highest: 4 5 6 8 9
Value 0 1 2 3 4 5 6 8 9
Frequency 3916 357 110 54 17 10 8 3 9
Proportion 0.873 0.080 0.025 0.012 0.004 0.002 0.002 0.001 0.002
--------------------------------------------------------------------------------
DATASRCE
n missing distinct Info Mean Gmd
7770 2230 3 0.766 2.488 0.6629
Value 1 2 3
Frequency 766 2449 4555
Proportion 0.099 0.315 0.586
--------------------------------------------------------------------------------
MALEMILI
n missing distinct Info Mean Gmd .05 .10
10000 0 60 0.525 1.005 1.845 0 0
.25 .50 .75 .90 .95
0 0 0 2 4
lowest : 0 1 2 3 4, highest: 90 93 94 97 99
--------------------------------------------------------------------------------
MALEVET
n missing distinct Info Mean Gmd .05 .10
10000 0 81 0.999 30.52 12.56 10 17
.25 .50 .75 .90 .95
25 31 37 44 49
lowest : 0 1 2 3 4, highest: 83 85 86 97 99
--------------------------------------------------------------------------------
VIETVETS
n missing distinct Info Mean Gmd .05 .10
10000 0 89 0.999 29.68 16.82 0 10
.25 .50 .75 .90 .95
20 29 39 49 55
lowest : 0 1 2 3 4, highest: 85 87 89 91 99
--------------------------------------------------------------------------------
WWIIVETS
n missing distinct Info Mean Gmd .05 .10
10000 0 98 1 32.86 19.74 0 11
.25 .50 .75 .90 .95
21 32 43 56 64
lowest : 0 1 2 3 4, highest: 93 94 96 98 99
--------------------------------------------------------------------------------
LOCALGOV
n missing distinct Info Mean Gmd .05 .10
10000 0 39 0.993 6.822 4.565 0 2
.25 .50 .75 .90 .95
4 6 9 12 14
lowest : 0 1 2 3 4, highest: 34 35 37 43 60
--------------------------------------------------------------------------------
STATEGOV
n missing distinct Info Mean Gmd .05 .10
10000 0 46 0.988 4.569 4.748 0 0
.25 .50 .75 .90 .95
1 3 6 10 14
lowest : 0 1 2 3 4, highest: 41 46 47 51 99
--------------------------------------------------------------------------------
FEDGOV
n missing distinct Info Mean Gmd .05 .10
10000 0 47 0.976 3.127 3.502 0 0
.25 .50 .75 .90 .95
1 2 4 7 10
lowest : 0 1 2 3 4, highest: 46 49 54 67 99
--------------------------------------------------------------------------------
SOLP3
n missing distinct Info Mean Gmd
12 9988 3 0.839 4.333 5.939
Value 0 2 12
Frequency 6 2 4
Proportion 0.500 0.167 0.333
--------------------------------------------------------------------------------
SOLIH
n missing distinct Info Mean Gmd
687 9313 6 0.203 11.19 1.517
lowest : 0 1 2 3 4, highest: 1 2 3 4 12
Value 0 1 2 3 4 12
Frequency 29 7 10 2 2 637
Proportion 0.042 0.010 0.015 0.003 0.003 0.927
--------------------------------------------------------------------------------
MAJOR
n missing distinct value
28 9972 1 X
Value X
Frequency 28
Proportion 1
--------------------------------------------------------------------------------
WEALTH2
n missing distinct Info Mean Gmd .05 .10
5451 4549 10 0.989 4.951 3.215 0 1
.25 .50 .75 .90 .95
3 5 7 9 9
lowest : 0 1 2 3 4, highest: 5 6 7 8 9
Value 0 1 2 3 4 5 6 7 8 9
Frequency 374 408 500 560 543 601 563 558 658 686
Proportion 0.069 0.075 0.092 0.103 0.100 0.110 0.103 0.102 0.121 0.126
--------------------------------------------------------------------------------
GEOCODE
n missing distinct Info Mean Gmd
1576 8424 7 0.967 6.978 5.12
lowest : 1 2 3 4 5, highest: 3 4 5 12 14
Value 1 2 3 4 5 12 14
Frequency 88 143 334 166 231 397 217
Proportion 0.056 0.091 0.212 0.105 0.147 0.252 0.138
--------------------------------------------------------------------------------
COLLECT1
n missing distinct value
490 9510 1 Y
Value Y
Frequency 490
Proportion 1
--------------------------------------------------------------------------------
VETERANS
n missing distinct value
1086 8914 1 Y
Value Y
Frequency 1086
Proportion 1
--------------------------------------------------------------------------------
BIBLE
n missing distinct value
892 9108 1 Y
Value Y
Frequency 892
Proportion 1
--------------------------------------------------------------------------------
CATLG
n missing distinct value
844 9156 1 Y
Value Y
Frequency 844
Proportion 1
--------------------------------------------------------------------------------
HOMEE
n missing distinct value
90 9910 1 Y
Value Y
Frequency 90
Proportion 1
--------------------------------------------------------------------------------
PETS
n missing distinct value
1453 8547 1 Y
Value Y
Frequency 1453
Proportion 1
--------------------------------------------------------------------------------
CDPLAY
n missing distinct value
1234 8766 1 Y
Value Y
Frequency 1234
Proportion 1
--------------------------------------------------------------------------------
STEREO
n missing distinct value
1294 8706 1 Y
Value Y
Frequency 1294
Proportion 1
--------------------------------------------------------------------------------
PCOWNERS
n missing distinct value
1058 8942 1 Y
Value Y
Frequency 1058
Proportion 1
--------------------------------------------------------------------------------
PHOTO
n missing distinct value
498 9502 1 Y
Value Y
Frequency 498
Proportion 1
--------------------------------------------------------------------------------
CRAFTS
n missing distinct value
784 9216 1 Y
Value Y
Frequency 784
Proportion 1
--------------------------------------------------------------------------------
FISHER
n missing distinct value
749 9251 1 Y
Value Y
Frequency 749
Proportion 1
--------------------------------------------------------------------------------
GARDENIN
n missing distinct value
1368 8632 1 Y
Value Y
Frequency 1368
Proportion 1
--------------------------------------------------------------------------------
BOATS
n missing distinct value
194 9806 1 Y
Value Y
Frequency 194
Proportion 1
--------------------------------------------------------------------------------
WALKER
n missing distinct value
1012 8988 1 Y
Value Y
Frequency 1012
Proportion 1
--------------------------------------------------------------------------------
KIDSTUFF
n missing distinct value
152 9848 1 Y
Value Y
Frequency 152
Proportion 1
--------------------------------------------------------------------------------
CARDS
n missing distinct value
103 9897 1 Y
Value Y
Frequency 103
Proportion 1
--------------------------------------------------------------------------------
PLATES
n missing distinct value
53 9947 1 Y
Value Y
Frequency 53
Proportion 1
--------------------------------------------------------------------------------
LIFESRC
n missing distinct Info Mean Gmd
4238 5762 3 0.851 2.034 0.7625
Value 1 2 3
Frequency 1016 2061 1161
Proportion 0.240 0.486 0.274
--------------------------------------------------------------------------------
PEPSTRFL
n missing distinct value
4697 5303 1 X
Value X
Frequency 4697
Proportion 1
--------------------------------------------------------------------------------
POP901
n missing distinct Info Mean Gmd .05 .10
10000 0 4333 1 3287 3767 544 706
.25 .50 .75 .90 .95
1003 1578 3058 6413 10917
lowest : 0 1 2 6 8, highest: 76571 79926 80680 85790 88442
--------------------------------------------------------------------------------
POP902
n missing distinct Info Mean Gmd .05 .10
10000 0 2279 1 874.2 997.4 140 185
.25 .50 .75 .90 .95
268 425 830 1759 2954
lowest : 0 1 2 3 4, highest: 18575 19094 20025 20621 21036
--------------------------------------------------------------------------------
POP903
n missing distinct Info Mean Gmd .05 .10
10000 0 2702 1 1240 1418 214 273
.25 .50 .75 .90 .95
382 590 1160 2410 4020
lowest : 0 1 2 3 4, highest: 27674 29306 30492 31198 33459
--------------------------------------------------------------------------------
POP90C1
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.796 58.89 47.63 0 0
.25 .50 .75 .90 .95
0 99 99 99 99
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
POP90C2
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.481 13.42 22.98 0 0
.25 .50 .75 .90 .95
0 0 0 82 99
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
POP90C3
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.782 26.17 37.92 0 0
.25 .50 .75 .90 .95
0 0 53 99 99
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
POP90C4
n missing distinct Info Mean Gmd .05 .10
10000 0 57 0.981 48.25 3.807 43 45
.25 .50 .75 .90 .95
47 49 50 51 52
lowest : 0 5 8 25 27, highest: 83 84 86 90 99
--------------------------------------------------------------------------------
POP90C5
n missing distinct Info Mean Gmd .05 .10
10000 0 57 0.981 51.06 3.847 48 49
.25 .50 .75 .90 .95
50 51 53 55 56
lowest : 0 10 14 16 17, highest: 73 75 92 95 99
--------------------------------------------------------------------------------
ETH1
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.994 84.89 18.71 35 60
.25 .50 .75 .90 .95
82 93 98 99 99
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
ETH2
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.958 7.531 11.9 0 0
.25 .50 .75 .90 .95
0 1 6 20 41
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
ETH3
n missing distinct Info Mean Gmd .05 .10
10000 0 44 0.698 0.7288 1.201 0 0
.25 .50 .75 .90 .95
0 0 1 1 2
lowest : 0 1 2 3 4, highest: 70 71 80 89 96
--------------------------------------------------------------------------------
ETH4
n missing distinct Info Mean Gmd .05 .10
10000 0 78 0.926 3.05 4.672 0 0
.25 .50 .75 .90 .95
0 1 3 7 13
lowest : 0 1 2 3 4, highest: 84 85 88 91 92
--------------------------------------------------------------------------------
ETH5
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.979 7.38 10.47 0 0
.25 .50 .75 .90 .95
1 2 7 19 34
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
ETH6
n missing distinct Info Mean Gmd .05 .10
10000 0 16 0.401 0.2296 0.4063 0 0
.25 .50 .75 .90 .95
0 0 0 1 1
lowest : 0 1 2 3 4, highest: 12 13 14 15 29
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 8421 1219 224 67 36 7 8 3 3 4 1
Proportion 0.842 0.122 0.022 0.007 0.004 0.001 0.001 0.000 0.000 0.000 0.000
Value 12 13 14 15 29
Frequency 2 1 1 2 1
Proportion 0.000 0.000 0.000 0.000 0.000
--------------------------------------------------------------------------------
ETH7
n missing distinct Info Mean Gmd .05 .10
10000 0 37 0.437 0.4235 0.7757 0 0
.25 .50 .75 .90 .95
0 0 0 1 2
lowest : 0 1 2 3 4, highest: 45 47 49 50 65
--------------------------------------------------------------------------------
ETH8
n missing distinct Info Mean Gmd .05 .10
10000 0 42 0.538 0.6521 1.168 0 0
.25 .50 .75 .90 .95
0 0 0 1 3
lowest : 0 1 2 3 4, highest: 41 42 43 57 67
--------------------------------------------------------------------------------
ETH9
n missing distinct Info Mean Gmd .05 .10
10000 0 43 0.498 0.6095 1.106 0 0
.25 .50 .75 .90 .95
0 0 0 1 3
lowest : 0 1 2 3 4, highest: 54 58 59 61 67
--------------------------------------------------------------------------------
ETH10
n missing distinct Info Mean Gmd .05 .10
10000 0 19 0.366 0.2496 0.4542 0 0
.25 .50 .75 .90 .95
0 0 0 1 1
lowest : 0 1 2 3 4, highest: 20 21 23 24 45
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 8584 996 210 88 41 32 11 16 5 4 2
Proportion 0.858 0.100 0.021 0.009 0.004 0.003 0.001 0.002 0.000 0.000 0.000
Value 11 13 18 20 21 23 24 45
Frequency 3 1 1 1 2 1 1 1
Proportion 0.000 0.000 0.000 0.000 0.000 0.000 0.000 0.000
--------------------------------------------------------------------------------
ETH11
n missing distinct Info Mean Gmd .05 .10
10000 0 22 0.294 0.2298 0.4308 0 0
.25 .50 .75 .90 .95
0 0 0 1 1
lowest : 0 1 2 3 4, highest: 19 21 22 24 45
--------------------------------------------------------------------------------
ETH12
n missing distinct Info Mean Gmd .05 .10
10000 0 22 0.043 0.0737 0.1466 0 0
.25 .50 .75 .90 .95
0 0 0 0 0
lowest : 0 1 2 4 5, highest: 24 25 26 48 54
--------------------------------------------------------------------------------
ETH13
n missing distinct Info Mean Gmd .05 .10
10000 0 96 0.941 5.024 7.821 0 0
.25 .50 .75 .90 .95
0 1 4 13 24
lowest : 0 1 2 3 4, highest: 91 92 93 94 96
--------------------------------------------------------------------------------
ETH14
n missing distinct Info Mean Gmd .05 .10
10000 0 25 0.421 0.3102 0.5579 0 0
.25 .50 .75 .90 .95
0 0 0 1 1
lowest : 0 1 2 3 4, highest: 26 28 29 31 36
--------------------------------------------------------------------------------
ETH15
n missing distinct Info Mean Gmd .05 .10
10000 0 49 0.219 0.3599 0.7031 0 0
.25 .50 .75 .90 .95
0 0 0 0 1
lowest : 0 1 2 3 4, highest: 67 68 69 70 77
--------------------------------------------------------------------------------
ETH16
n missing distinct Info Mean Gmd .05 .10
10000 0 43 0.884 1.503 2.162 0 0
.25 .50 .75 .90 .95
0 1 2 4 6
lowest : 0 1 2 3 4, highest: 40 42 43 44 61
--------------------------------------------------------------------------------
AGE901
n missing distinct Info Mean Gmd .05 .10
10000 0 66 0.996 34.72 8.055 26 28
.25 .50 .75 .90 .95
30 33 37 43 49
lowest : 0 15 17 18 19, highest: 76 77 78 79 84
--------------------------------------------------------------------------------
# Pas dans les 100 premières
describe(data.train[,101:200])
data.train[, 101:200]
100 Variables 10000 Observations
--------------------------------------------------------------------------------
AGE902
n missing distinct Info Mean Gmd .05 .10
10000 0 61 0.997 41.84 8.188 32 34
.25 .50 .75 .90 .95
38 41 45 51 56
lowest : 0 19 20 21 22, highest: 74 75 76 77 80
--------------------------------------------------------------------------------
AGE903
n missing distinct Info Mean Gmd .05 .10
10000 0 53 0.997 45.07 8.04 36 38
.25 .50 .75 .90 .95
41 44 48 54 59
lowest : 0 28 29 30 31, highest: 75 76 77 78 80
--------------------------------------------------------------------------------
AGE904
n missing distinct Info Mean Gmd .05 .10
10000 0 59 0.996 35.86 7.085 28 29
.25 .50 .75 .90 .95
32 35 39 43 47
lowest : 0 19 20 21 22, highest: 72 73 74 75 76
--------------------------------------------------------------------------------
AGE905
n missing distinct Info Mean Gmd .05 .10
10000 0 56 0.996 44.65 6.628 37 39
.25 .50 .75 .90 .95
41 45 48 51 54
lowest : 0 21 22 24 25, highest: 72 73 74 75 76
--------------------------------------------------------------------------------
AGE906
n missing distinct Info Mean Gmd .05 .10
10000 0 47 0.996 47.86 6.397 40 42
.25 .50 .75 .90 .95
45 48 51 54 57
lowest : 0 31 32 33 34, highest: 72 73 74 75 76
--------------------------------------------------------------------------------
AGE907
n missing distinct Info Mean Gmd .05 .10
10000 0 53 0.998 24.55 8.053 11 15
.25 .50 .75 .90 .95
21 26 29 33 35
lowest : 0 1 2 3 4, highest: 49 50 51 54 55
--------------------------------------------------------------------------------
CHIL1
n missing distinct Info Mean Gmd .05 .10
10000 0 66 0.998 39.56 8.412 29 32
.25 .50 .75 .90 .95
35 39 44 49 52
lowest : 0 11 13 15 16, highest: 75 78 83 86 99
--------------------------------------------------------------------------------
CHIL2
n missing distinct Info Mean Gmd .05 .10
10000 0 56 0.994 38.35 5.833 30 33
.25 .50 .75 .90 .95
36 39 42 44 45
lowest : 0 4 7 8 9, highest: 57 62 67 80 99
--------------------------------------------------------------------------------
CHIL3
n missing distinct Info Mean Gmd .05 .10
10000 0 56 0.995 21 5.582 13 15
.25 .50 .75 .90 .95
18 21 24 26 28
lowest : 0 3 4 5 6, highest: 67 68 73 75 99
--------------------------------------------------------------------------------
AGEC1
n missing distinct Info Mean Gmd .05 .10
10000 0 78 0.994 12.27 5.609 6 7
.25 .50 .75 .90 .95
9 11 14 18 21
lowest : 0 1 2 3 4, highest: 80 81 85 87 93
--------------------------------------------------------------------------------
AGEC2
n missing distinct Info Mean Gmd .05 .10
10000 0 60 0.998 22.17 8.371 11 13
.25 .50 .75 .90 .95
18 22 26 32 35
lowest : 0 1 2 3 4, highest: 55 56 58 59 60
--------------------------------------------------------------------------------
AGEC3
n missing distinct Info Mean Gmd .05 .10
10000 0 51 0.996 20.62 6.576 12 14
.25 .50 .75 .90 .95
17 20 24 28 31
lowest : 0 1 2 3 4, highest: 46 47 48 50 63
--------------------------------------------------------------------------------
AGEC4
n missing distinct Info Mean Gmd .05 .10
10000 0 36 0.993 14.05 4.66 8 9
.25 .50 .75 .90 .95
12 14 16 19 21
lowest : 0 1 2 3 4, highest: 31 32 33 34 37
--------------------------------------------------------------------------------
AGEC5
n missing distinct Info Mean Gmd .05 .10
10000 0 38 0.993 11.85 4.536 5 7
.25 .50 .75 .90 .95
9 12 14 17 19
lowest : 0 1 2 3 4, highest: 33 34 36 39 40
--------------------------------------------------------------------------------
AGEC6
n missing distinct Info Mean Gmd .05 .10
10000 0 53 0.996 10.49 6.025 3 4
.25 .50 .75 .90 .95
7 10 13 17 20
lowest : 0 1 2 3 4, highest: 49 50 52 54 61
--------------------------------------------------------------------------------
AGEC7
n missing distinct Info Mean Gmd .05 .10
10000 0 63 0.995 7.682 6.339 1 2
.25 .50 .75 .90 .95
3 6 10 15 19
lowest : 0 1 2 3 4, highest: 60 62 63 67 68
--------------------------------------------------------------------------------
CHILC1
n missing distinct Info Mean Gmd .05 .10
10000 0 45 0.994 16.15 5.19 10 11
.25 .50 .75 .90 .95
13 16 19 22 24
lowest : 0 1 3 4 5, highest: 42 44 50 60 99
--------------------------------------------------------------------------------
CHILC2
n missing distinct Info Mean Gmd .05 .10
10000 0 37 0.988 16.13 3.69 11 13
.25 .50 .75 .90 .95
14 16 18 20 21
lowest : 0 1 2 3 4, highest: 33 37 42 44 99
--------------------------------------------------------------------------------
CHILC3
n missing distinct Info Mean Gmd .05 .10
10000 0 53 0.991 31.63 4.882 25 27
.25 .50 .75 .90 .95
30 32 34 36 38
lowest : 0 1 2 4 5, highest: 54 56 57 67 99
--------------------------------------------------------------------------------
CHILC4
n missing distinct Info Mean Gmd .05 .10
10000 0 41 0.993 19.76 4.611 13 15
.25 .50 .75 .90 .95
18 20 22 24 26
lowest : 0 2 3 4 5, highest: 38 39 43 50 99
--------------------------------------------------------------------------------
CHILC5
n missing distinct Info Mean Gmd .05 .10
10000 0 70 0.991 15.24 4.781 9 11
.25 .50 .75 .90 .95
13 15 17 20 22
lowest : 0 1 2 3 4, highest: 83 89 90 97 99
--------------------------------------------------------------------------------
HHAGE1
n missing distinct Info Mean Gmd .05 .10
10000 0 98 0.999 24.51 13.82 7 10
.25 .50 .75 .90 .95
16 23 31 39 46
lowest : 0 1 2 3 4, highest: 93 94 95 96 99
--------------------------------------------------------------------------------
HHAGE2
n missing distinct Info Mean Gmd .05 .10
10000 0 69 0.997 9.393 7.474 1 2
.25 .50 .75 .90 .95
4 8 12 18 22
lowest : 0 1 2 3 4, highest: 69 70 89 91 99
--------------------------------------------------------------------------------
HHAGE3
n missing distinct Info Mean Gmd .05 .10
10000 0 97 0.999 22.21 13.64 5 7
.25 .50 .75 .90 .95
13 21 29 37 43
lowest : 0 1 2 3 4, highest: 93 94 95 96 99
--------------------------------------------------------------------------------
HHN1
n missing distinct Info Mean Gmd .05 .10
10000 0 92 0.999 22.89 12.54 8 10
.25 .50 .75 .90 .95
15 21 28 38 44
lowest : 0 1 2 3 4, highest: 90 91 93 95 99
--------------------------------------------------------------------------------
HHN2
n missing distinct Info Mean Gmd .05 .10
10000 0 74 0.998 33.5 8.479 22 25
.25 .50 .75 .90 .95
29 33 37 42 46
lowest : 0 5 7 9 10, highest: 76 80 81 82 85
--------------------------------------------------------------------------------
HHN3
n missing distinct Info Mean Gmd .05 .10
10000 0 90 0.999 42.7 16.09 16 24
.25 .50 .75 .90 .95
34 44 52 60 65
lowest : 0 1 2 3 4, highest: 85 86 87 89 99
--------------------------------------------------------------------------------
HHN4
n missing distinct Info Mean Gmd .05 .10
10000 0 75 0.999 25.6 12.23 7 12
.25 .50 .75 .90 .95
19 25 32 39 44
lowest : 0 1 2 3 4, highest: 70 71 72 76 78
--------------------------------------------------------------------------------
HHN5
n missing distinct Info Mean Gmd .05 .10
10000 0 56 0.996 10.49 6.538 2 4
.25 .50 .75 .90 .95
7 10 13 18 21
lowest : 0 1 2 3 4, highest: 54 55 56 59 61
--------------------------------------------------------------------------------
HHN6
n missing distinct Info Mean Gmd .05 .10
10000 0 39 0.977 3.916 3.309 0 1
.25 .50 .75 .90 .95
2 3 5 7 10
lowest : 0 1 2 3 4, highest: 36 40 41 43 44
--------------------------------------------------------------------------------
MARR1
n missing distinct Info Mean Gmd .05 .10
10000 0 86 0.999 57.9 13.66 33.95 42.00
.25 .50 .75 .90 .95
52.00 61.00 67.00 71.00 73.00
lowest : 0 2 3 5 7, highest: 85 87 88 90 99
--------------------------------------------------------------------------------
MARR2
n missing distinct Info Mean Gmd .05 .10
10000 0 39 0.994 10.76 4.928 4 6
.25 .50 .75 .90 .95
8 10 13 17 19
lowest : 0 1 2 3 4, highest: 35 37 44 50 52
--------------------------------------------------------------------------------
MARR3
n missing distinct Info Mean Gmd .05 .10
10000 0 47 0.993 7.449 4.908 2 3
.25 .50 .75 .90 .95
4 7 9 13 16
lowest : 0 1 2 3 4, highest: 43 44 62 67 73
--------------------------------------------------------------------------------
MARR4
n missing distinct Info Mean Gmd .05 .10
10000 0 84 0.997 23.01 8.993 13 15
.25 .50 .75 .90 .95
18 21 26 34 39
lowest : 0 2 3 4 5, highest: 88 89 93 94 97
--------------------------------------------------------------------------------
HHP1
n missing distinct Info Mean Gmd .05 .10
10000 0 326 1 185 53.59 110 132
.25 .50 .75 .90 .95
158 181 211 247 271
lowest : 0 50 53 54 55, highest: 428 445 446 456 460
--------------------------------------------------------------------------------
HHP2
n missing distinct Info Mean Gmd .05 .10
10000 0 318 1 259.5 51.98 183 206
.25 .50 .75 .90 .95
235 262 288 313 331
lowest : 0 100 105 107 109, highest: 468 469 476 490 494
--------------------------------------------------------------------------------
DW1
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.999 69.69 27.3 15 33
.25 .50 .75 .90 .95
57 75 89 99 99
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
DW2
n missing distinct Info Mean Gmd .05 .10
10000 0 100 1 65.82 29.03 10 25
.25 .50 .75 .90 .95
51 71 86 97 99
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
DW3
n missing distinct Info Mean Gmd .05 .10
10000 0 55 0.935 2.898 4.129 0 0
.25 .50 .75 .90 .95
0 1 3 8 12
lowest : 0 1 2 3 4, highest: 55 56 61 65 66
--------------------------------------------------------------------------------
DW4
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.994 19.59 24.12 0 0
.25 .50 .75 .90 .95
1 10 30 56 74
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
DW5
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.982 16.67 22.06 0 0
.25 .50 .75 .90 .95
0 6 25 51 69
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
DW6
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.955 13.18 18.78 0 0
.25 .50 .75 .90 .95
0 3 19 42 61
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
DW7
n missing distinct Info Mean Gmd .05 .10
10000 0 73 0.657 1.861 3.274 0 0
.25 .50 .75 .90 .95
0 0 1 5 10
lowest : 0 1 2 3 4, highest: 88 92 93 97 99
--------------------------------------------------------------------------------
DW8
n missing distinct Info Mean Gmd .05 .10
10000 0 55 0.523 1.198 2.167 0 0
.25 .50 .75 .90 .95
0 0 0 3 7
lowest : 0 1 2 3 4, highest: 73 75 77 83 88
--------------------------------------------------------------------------------
DW9
n missing distinct Info Mean Gmd .05 .10
10000 0 59 0.329 0.6562 1.258 0 0
.25 .50 .75 .90 .95
0 0 0 1 2
lowest : 0 1 2 3 4, highest: 71 77 92 97 99
--------------------------------------------------------------------------------
HV1
n missing distinct Info Mean Gmd .05 .10
10000 0 2453 1 1048 842.4 300.0 372.0
.25 .50 .75 .90 .95
515.8 736.0 1188.0 2159.6 2979.0
lowest : 0 75 98 109 120, highest: 5970 5976 5980 5983 5985
--------------------------------------------------------------------------------
HV2
n missing distinct Info Mean Gmd .05 .10
10000 0 2573 1 1117 872.3 334 409
.25 .50 .75 .90 .95
558 804 1300 2280 3137
lowest : 0 90 125 137 143, highest: 5928 5932 5933 5950 5955
--------------------------------------------------------------------------------
HV3
n missing distinct Info Mean Gmd .05 .10
10000 0 14 0.972 4.197 2.418 2 2
.25 .50 .75 .90 .95
3 4 5 7 9
lowest : 0 1 2 3 4, highest: 9 10 11 12 13
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 98 342 1863 2267 1912 1282 829 524 296 238 105
Proportion 0.010 0.034 0.186 0.227 0.191 0.128 0.083 0.052 0.030 0.024 0.011
Value 11 12 13
Frequency 101 125 18
Proportion 0.010 0.013 0.002
--------------------------------------------------------------------------------
HV4
n missing distinct Info Mean Gmd .05 .10
10000 0 14 0.977 3.86 2.426 1 1
.25 .50 .75 .90 .95
2 3 5 7 8
lowest : 0 1 2 3 4, highest: 9 10 11 12 13
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 136 1009 1967 1941 1715 1185 767 526 342 200 129
Proportion 0.014 0.101 0.197 0.194 0.172 0.118 0.077 0.053 0.034 0.020 0.013
Value 11 12 13
Frequency 62 19 2
Proportion 0.006 0.002 0.000
--------------------------------------------------------------------------------
HU1
n missing distinct Info Mean Gmd .05 .10
10000 0 100 1 69.35 23.33 23 37
.25 .50 .75 .90 .95
59 75 85 92 95
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HU2
n missing distinct Info Mean Gmd .05 .10
10000 0 100 1 29.73 22.59 5 8
.25 .50 .75 .90 .95
15 24 40 61 74
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HU3
n missing distinct Info Mean Gmd .05 .10
10000 0 82 0.995 89.89 10.26 69 80
.25 .50 .75 .90 .95
89 94 96 98 99
lowest : 0 17 18 20 21, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HU4
n missing distinct Info Mean Gmd .05 .10
10000 0 83 0.995 9.206 8.82 1 2
.25 .50 .75 .90 .95
3 6 11 19 28
lowest : 0 1 2 3 4, highest: 79 80 82 83 99
--------------------------------------------------------------------------------
HU5
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.968 13.48 18.94 0 0
.25 .50 .75 .90 .95
0 5 15 46 65
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HHD1
n missing distinct Info Mean Gmd .05 .10
10000 0 87 0.999 35.7 14.38 12 19
.25 .50 .75 .90 .95
28 36 44 51 56
lowest : 0 1 2 3 4, highest: 83 84 85 86 99
--------------------------------------------------------------------------------
HHD2
n missing distinct Info Mean Gmd .05 .10
10000 0 95 0.999 71.4 15.58 43 53
.25 .50 .75 .90 .95
65 75 81 86 89
lowest : 0 5 7 8 9, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HHD3
n missing distinct Info Mean Gmd .05 .10
10000 0 94 1 58.55 17.88 29 37
.25 .50 .75 .90 .95
49 61 70 77 81
lowest : 0 4 5 6 7, highest: 92 93 94 95 99
--------------------------------------------------------------------------------
HHD4
n missing distinct Info Mean Gmd .05 .10
10000 0 78 0.999 27.73 13.25 8 13
.25 .50 .75 .90 .95
20 28 35 43 48
lowest : 0 1 2 3 4, highest: 75 76 77 80 81
--------------------------------------------------------------------------------
HHD5
n missing distinct Info Mean Gmd .05 .10
10000 0 94 0.998 81.8 12.94 56 67
.25 .50 .75 .90 .95
79 86 90 93 94
lowest : 0 2 3 8 9, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HHD6
n missing distinct Info Mean Gmd .05 .10
10000 0 95 0.998 17.31 11.79 5 7
.25 .50 .75 .90 .95
10 14 21 31 41
lowest : 0 1 2 3 4, highest: 91 92 97 98 99
--------------------------------------------------------------------------------
HHD7
n missing distinct Info Mean Gmd .05 .10
10000 0 53 0.993 7.973 5.264 2 3
.25 .50 .75 .90 .95
5 7 10 14 17
lowest : 0 1 2 3 4, highest: 48 50 57 66 99
--------------------------------------------------------------------------------
HHD8
n missing distinct Info Mean Gmd .05 .10
10000 0 13 0.898 1.641 1.126 0 0
.25 .50 .75 .90 .95
1 1 2 3 4
lowest : 0 1 2 3 4, highest: 8 9 10 11 12
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 1046 3993 3306 1125 351 104 44 15 9 4 1
Proportion 0.105 0.399 0.331 0.112 0.035 0.010 0.004 0.002 0.001 0.000 0.000
Value 11 12
Frequency 1 1
Proportion 0.000 0.000
--------------------------------------------------------------------------------
HHD9
n missing distinct Info Mean Gmd .05 .10
10000 0 49 0.99 6.334 4.504 2 2
.25 .50 .75 .90 .95
4 5 8 11 15
lowest : 0 1 2 3 4, highest: 45 48 55 62 99
--------------------------------------------------------------------------------
HHD10
n missing distinct Info Mean Gmd .05 .10
10000 0 62 0.996 13.65 7.141 5 7
.25 .50 .75 .90 .95
9 12 16 22 27
lowest : 0 1 2 3 4, highest: 60 62 64 67 99
--------------------------------------------------------------------------------
HHD11
n missing distinct Info Mean Gmd .05 .10
10000 0 72 0.999 18.92 10.16 6 9
.25 .50 .75 .90 .95
12 18 24 31 36
lowest : 0 1 2 3 4, highest: 69 71 78 85 99
--------------------------------------------------------------------------------
HHD12
n missing distinct Info Mean Gmd .05 .10
10000 0 48 0.983 4.804 3.806 1 1
.25 .50 .75 .90 .95
2 4 6 9 12
lowest : 0 1 2 3 4, highest: 50 53 56 58 67
--------------------------------------------------------------------------------
ETHC1
n missing distinct Info Mean Gmd .05 .10
10000 0 42 0.998 16.76 7.561 3 7
.25 .50 .75 .90 .95
13 18 22 25 26
lowest : 0 1 2 3 4, highest: 37 38 39 41 46
--------------------------------------------------------------------------------
ETHC2
n missing distinct Info Mean Gmd .05 .10
10000 0 90 0.999 50.74 14.63 18 33
.25 .50 .75 .90 .95
46 54 60 64 66
lowest : 0 1 2 3 4, highest: 85 86 89 90 92
--------------------------------------------------------------------------------
ETHC3
n missing distinct Info Mean Gmd .05 .10
10000 0 97 0.999 17.32 12.12 3 5
.25 .50 .75 .90 .95
10 15 22 30 38
lowest : 0 1 2 3 4, highest: 93 94 95 96 97
--------------------------------------------------------------------------------
ETHC4
n missing distinct Info Mean Gmd .05 .10
10000 0 38 0.812 1.993 3.254 0 0
.25 .50 .75 .90 .95
0 0 1 6 11
lowest : 0 1 2 3 4, highest: 33 34 35 37 39
--------------------------------------------------------------------------------
ETHC5
n missing distinct Info Mean Gmd .05 .10
10000 0 72 0.929 4.653 7.331 0 0
.25 .50 .75 .90 .95
0 1 4 13 25
lowest : 0 1 2 3 4, highest: 67 68 71 75 83
--------------------------------------------------------------------------------
ETHC6
n missing distinct Info Mean Gmd .05 .10
10000 0 38 0.515 0.8284 1.495 0 0
.25 .50 .75 .90 .95
0 0 0 2 5
lowest : 0 1 2 3 4, highest: 40 42 43 54 81
--------------------------------------------------------------------------------
HVP1
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.922 13.13 21.19 0 0
.25 .50 .75 .90 .95
0 1 9 59 88
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HVP2
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.981 20.57 29.6 0 0
.25 .50 .75 .90 .95
0 3 26 85 96
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HVP3
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.998 34.77 39.41 0 1
.25 .50 .75 .90 .95
4 17 67 97 99
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HVP4
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.998 51.01 41.67 1 4
.25 .50 .75 .90 .95
15 48 92 99 99
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HVP5
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.979 73.92 31.36 14 25
.25 .50 .75 .90 .95
53 89 99 99 99
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HVP6
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.724 6.102 10.88 0 0
.25 .50 .75 .90 .95
0 0 1 16 49
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HUR1
n missing distinct Info Mean Gmd .05 .10
10000 0 72 0.983 4.94 6.197 0 0
.25 .50 .75 .90 .95
1 2 6 12 19
lowest : 0 1 2 3 4, highest: 76 78 89 95 99
--------------------------------------------------------------------------------
HUR2
n missing distinct Info Mean Gmd .05 .10
10000 0 100 1 45.55 24.32 11 18
.25 .50 .75 .90 .95
31 43 59 76 85
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
RHP1
n missing distinct Info Mean Gmd .05 .10
10000 0 72 0.999 52.84 11.31 37 42
.25 .50 .75 .90 .95
48 52 58 66 71
lowest : 0 5 8 11 13, highest: 80 81 82 83 84
--------------------------------------------------------------------------------
RHP2
n missing distinct Info Mean Gmd .05 .10
10000 0 74 0.999 53.64 10.9 38 43
.25 .50 .75 .90 .95
49 54 59 66 70
lowest : 0 10 13 14 15, highest: 83 84 85 86 88
--------------------------------------------------------------------------------
RHP3
n missing distinct Info Mean Gmd .05 .10
10000 0 20 0.975 14.13 2.567 11 12
.25 .50 .75 .90 .95
13 14 15 17 18
lowest : 0 10 11 12 13, highest: 24 25 26 29 30
Value 0 10 11 12 13 14 15 16 17 18 19
Frequency 89 406 442 939 1822 2229 1808 1068 525 283 137
Proportion 0.009 0.041 0.044 0.094 0.182 0.223 0.181 0.107 0.052 0.028 0.014
Value 20 21 22 23 24 25 26 29 30
Frequency 103 71 37 18 10 9 2 1 1
Proportion 0.010 0.007 0.004 0.002 0.001 0.001 0.000 0.000 0.000
--------------------------------------------------------------------------------
RHP4
n missing distinct Info Mean Gmd .05 .10
10000 0 16 0.859 4.399 1.095 3 3
.25 .50 .75 .90 .95
4 4 5 5 6
lowest : 0 1 2 3 4, highest: 11 12 13 14 15
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 92 27 120 979 4784 3124 517 171 88 36 24
Proportion 0.009 0.003 0.012 0.098 0.478 0.312 0.052 0.017 0.009 0.004 0.002
Value 11 12 13 14 15
Frequency 17 8 7 4 2
Proportion 0.002 0.001 0.001 0.000 0.000
--------------------------------------------------------------------------------
HUPA1
n missing distinct Info Mean Gmd .05 .10
10000 0 90 0.991 10.25 12.42 0 0
.25 .50 .75 .90 .95
1 6 15 27 36
lowest : 0 1 2 3 4, highest: 90 91 92 93 99
--------------------------------------------------------------------------------
HUPA2
n missing distinct Info Mean Gmd .05 .10
10000 0 99 0.897 9.324 14.33 0 0
.25 .50 .75 .90 .95
0 1 11 31 48
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HUPA3
n missing distinct Info Mean Gmd .05 .10
10000 0 99 0.868 8.782 13.1 0 0
.25 .50 .75 .90 .95
0 0 13 28 38
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HUPA4
n missing distinct Info Mean Gmd .05 .10
10000 0 70 0.998 11.17 8.028 2 3
.25 .50 .75 .90 .95
6 10 15 21 25
lowest : 0 1 2 3 4, highest: 82 89 92 98 99
--------------------------------------------------------------------------------
HUPA5
n missing distinct Info Mean Gmd .05 .10
10000 0 60 0.977 5.288 6.829 0 0
.25 .50 .75 .90 .95
0 2 7 14 20
lowest : 0 1 2 3 4, highest: 60 67 69 78 99
--------------------------------------------------------------------------------
HUPA6
n missing distinct Info Mean Gmd .05 .10
10000 0 99 0.944 11.18 16.17 0 0
.25 .50 .75 .90 .95
0 3 15 36 53
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HUPA7
n missing distinct Info Mean Gmd .05 .10
10000 0 37 0.799 1.641 2.493 0 0
.25 .50 .75 .90 .95
0 0 2 5 7
lowest : 0 1 2 3 4, highest: 35 37 38 41 99
--------------------------------------------------------------------------------
RP1
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.995 28.78 34.3 0.00 0.00
.25 .50 .75 .90 .95
2.00 13.00 53.25 83.00 92.00
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
RP2
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.999 42.16 39.54 0 1
.25 .50 .75 .90 .95
7 35 77 93 96
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
RP3
n missing distinct Info Mean Gmd .05 .10
10000 0 100 1 59.65 37.19 4 9
.25 .50 .75 .90 .95
29 69 91 97 98
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
RP4
n missing distinct Info Mean Gmd .05 .10
10000 0 99 0.999 76.45 25.54 23 37
.25 .50 .75 .90 .95
64 86 95 98 99
lowest : 0 2 3 4 5, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
MSA
n missing distinct Info Mean Gmd .05 .10
9989 11 257 0.989 3544 3269 0 0
.25 .50 .75 .90 .95
520 3360 5960 7480 8280
lowest : 0 40 120 200 220, highest: 9200 9260 9270 9340 9360
--------------------------------------------------------------------------------
ADI
n missing distinct Info Mean Gmd .05 .10
9989 11 183 0.999 189.2 155.1 13 15
.25 .50 .75 .90 .95
67 179 281 383 435
lowest : 0 3 9 11 13, highest: 613 621 625 627 645
--------------------------------------------------------------------------------
DMA
n missing distinct Info Mean Gmd .05 .10
9989 11 186 0.999 664.7 132.3 513 524
.25 .50 .75 .90 .95
561 635 803 819 839
lowest : 0 500 501 503 504, highest: 855 862 866 868 881
--------------------------------------------------------------------------------
IC1
n missing distinct Info Mean Gmd .05 .10
10000 0 806 1 337.9 167 147.0 179.0
.25 .50 .75 .90 .95
231.0 309.0 412.0 531.1 624.0
lowest : 0 50 54 56 59, highest: 1356 1368 1386 1395 1500
--------------------------------------------------------------------------------
IC2
n missing distinct Info Mean Gmd .05 .10
10000 0 865 1 384.7 176.9 185.0 219.0
.25 .50 .75 .90 .95
273.0 354.0 463.2 586.0 679.0
lowest : 0 50 54 56 59, highest: 1434 1442 1493 1496 1500
--------------------------------------------------------------------------------
describe(data.val[,101:200])
data.val[, 101:200]
100 Variables 10000 Observations
--------------------------------------------------------------------------------
AGE902
n missing distinct Info Mean Gmd .05 .10
10000 0 63 0.997 42.13 8.149 33 35
.25 .50 .75 .90 .95
38 41 45 51 57
lowest : 0 19 20 21 22, highest: 76 77 78 79 84
--------------------------------------------------------------------------------
AGE903
n missing distinct Info Mean Gmd .05 .10
10000 0 54 0.997 45.27 8.034 36 38
.25 .50 .75 .90 .95
41 44 49 54 60
lowest : 0 29 30 31 32, highest: 77 78 79 80 84
--------------------------------------------------------------------------------
AGE904
n missing distinct Info Mean Gmd .05 .10
10000 0 62 0.996 36.11 7.127 28 29
.25 .50 .75 .90 .95
32 35 39 43 48
lowest : 0 18 19 20 21, highest: 74 75 76 77 78
--------------------------------------------------------------------------------
AGE905
n missing distinct Info Mean Gmd .05 .10
10000 0 57 0.996 44.87 6.564 37 39
.25 .50 .75 .90 .95
42 45 48 51 55
lowest : 0 23 24 25 26, highest: 74 75 76 77 78
--------------------------------------------------------------------------------
AGE906
n missing distinct Info Mean Gmd .05 .10
10000 0 48 0.996 48.04 6.338 40 42
.25 .50 .75 .90 .95
45 48 51 54 57
lowest : 0 32 33 34 35, highest: 74 75 76 77 78
--------------------------------------------------------------------------------
AGE907
n missing distinct Info Mean Gmd .05 .10
10000 0 52 0.998 24.43 8.094 10 15
.25 .50 .75 .90 .95
21 25 29 33 35
lowest : 0 1 2 3 4, highest: 47 48 50 51 55
--------------------------------------------------------------------------------
CHIL1
n missing distinct Info Mean Gmd .05 .10
10000 0 68 0.998 39.66 8.333 29 32
.25 .50 .75 .90 .95
35 39 44 49 52
lowest : 0 13 14 15 16, highest: 78 80 82 84 99
--------------------------------------------------------------------------------
CHIL2
n missing distinct Info Mean Gmd .05 .10
10000 0 54 0.994 38.41 5.688 30 33
.25 .50 .75 .90 .95
36 39 42 44 45
lowest : 0 6 10 11 13, highest: 60 61 67 75 99
--------------------------------------------------------------------------------
CHIL3
n missing distinct Info Mean Gmd .05 .10
10000 0 52 0.995 21.04 5.548 13 15
.25 .50 .75 .90 .95
18 21 24 26 28
lowest : 0 5 6 7 8, highest: 60 63 67 70 99
--------------------------------------------------------------------------------
AGEC1
n missing distinct Info Mean Gmd .05 .10
10000 0 73 0.993 12.04 5.339 5 7
.25 .50 .75 .90 .95
9 11 14 17 20
lowest : 0 1 2 3 4, highest: 76 77 83 89 90
--------------------------------------------------------------------------------
AGEC2
n missing distinct Info Mean Gmd .05 .10
10000 0 61 0.998 22.14 8.446 10 13
.25 .50 .75 .90 .95
18 22 26 32 36
lowest : 0 1 2 3 4, highest: 56 57 58 61 62
--------------------------------------------------------------------------------
AGEC3
n missing distinct Info Mean Gmd .05 .10
10000 0 52 0.996 20.7 6.591 12 14
.25 .50 .75 .90 .95
17 20 24 28 32
lowest : 0 1 2 3 4, highest: 48 50 55 58 99
--------------------------------------------------------------------------------
AGEC4
n missing distinct Info Mean Gmd .05 .10
10000 0 35 0.993 14.13 4.65 8 9
.25 .50 .75 .90 .95
12 14 16 19 21
lowest : 0 1 2 3 4, highest: 30 31 32 33 34
--------------------------------------------------------------------------------
AGEC5
n missing distinct Info Mean Gmd .05 .10
10000 0 36 0.993 11.91 4.507 5 7
.25 .50 .75 .90 .95
9 12 14 17 19
lowest : 0 1 2 3 4, highest: 31 32 34 36 99
--------------------------------------------------------------------------------
AGEC6
n missing distinct Info Mean Gmd .05 .10
10000 0 54 0.996 10.64 6.156 3 4
.25 .50 .75 .90 .95
7 10 13 17 21
lowest : 0 1 2 3 4, highest: 49 50 53 54 55
--------------------------------------------------------------------------------
AGEC7
n missing distinct Info Mean Gmd .05 .10
10000 0 68 0.995 7.765 6.466 1 2
.25 .50 .75 .90 .95
3 6 10 15 20
lowest : 0 1 2 3 4, highest: 65 69 72 78 83
--------------------------------------------------------------------------------
CHILC1
n missing distinct Info Mean Gmd .05 .10
10000 0 48 0.994 16.24 5.164 10 11
.25 .50 .75 .90 .95
13 16 19 22 25
lowest : 0 2 4 5 6, highest: 48 50 51 60 75
--------------------------------------------------------------------------------
CHILC2
n missing distinct Info Mean Gmd .05 .10
10000 0 38 0.988 16.15 3.625 11 13
.25 .50 .75 .90 .95
14 16 18 20 21
lowest : 0 3 4 5 6, highest: 40 50 60 75 99
--------------------------------------------------------------------------------
CHILC3
n missing distinct Info Mean Gmd .05 .10
10000 0 50 0.991 31.73 4.736 25 27
.25 .50 .75 .90 .95
30 32 34 36 38
lowest : 0 2 5 7 8, highest: 50 52 63 67 99
--------------------------------------------------------------------------------
CHILC4
n missing distinct Info Mean Gmd .05 .10
10000 0 45 0.993 19.74 4.583 13 15
.25 .50 .75 .90 .95
18 20 22 24 26
lowest : 0 1 3 4 5, highest: 44 50 52 60 75
--------------------------------------------------------------------------------
CHILC5
n missing distinct Info Mean Gmd .05 .10
10000 0 66 0.991 15.26 4.636 9 11
.25 .50 .75 .90 .95
13 15 17 19 21
lowest : 0 1 3 4 5, highest: 75 80 89 92 99
--------------------------------------------------------------------------------
HHAGE1
n missing distinct Info Mean Gmd .05 .10
10000 0 97 0.999 24.79 14.1 7 10
.25 .50 .75 .90 .95
16 24 32 39 47
lowest : 0 1 2 3 4, highest: 92 93 94 95 99
--------------------------------------------------------------------------------
HHAGE2
n missing distinct Info Mean Gmd .05 .10
10000 0 68 0.997 9.41 7.462 1 2
.25 .50 .75 .90 .95
4 8 12 17 22
lowest : 0 1 2 3 4, highest: 67 68 69 70 72
--------------------------------------------------------------------------------
HHAGE3
n missing distinct Info Mean Gmd .05 .10
10000 0 96 0.999 22.49 13.91 5 8
.25 .50 .75 .90 .95
14 21 29 37 44
lowest : 0 1 2 3 4, highest: 91 92 93 94 99
--------------------------------------------------------------------------------
HHN1
n missing distinct Info Mean Gmd .05 .10
10000 0 90 0.999 22.87 12.56 8 10
.25 .50 .75 .90 .95
15 21 28 37 46
lowest : 0 1 2 3 4, highest: 88 89 91 93 99
--------------------------------------------------------------------------------
HHN2
n missing distinct Info Mean Gmd .05 .10
10000 0 72 0.997 33.73 8.505 22 25
.25 .50 .75 .90 .95
30 33 38 43 47
lowest : 0 6 7 8 9, highest: 74 78 79 85 99
--------------------------------------------------------------------------------
HHN3
n missing distinct Info Mean Gmd .05 .10
10000 0 89 0.999 42.67 16.18 16 24
.25 .50 .75 .90 .95
35 44 52 60 65
lowest : 0 1 2 3 4, highest: 85 86 87 89 98
--------------------------------------------------------------------------------
HHN4
n missing distinct Info Mean Gmd .05 .10
10000 0 73 0.999 25.61 12.33 7 11
.25 .50 .75 .90 .95
19 26 32 39 44
lowest : 0 1 2 3 4, highest: 68 70 73 76 92
--------------------------------------------------------------------------------
HHN5
n missing distinct Info Mean Gmd .05 .10
10000 0 58 0.996 10.45 6.584 2 4
.25 .50 .75 .90 .95
7 10 13 17 22
lowest : 0 1 2 3 4, highest: 54 59 60 61 69
--------------------------------------------------------------------------------
HHN6
n missing distinct Info Mean Gmd .05 .10
10000 0 42 0.977 3.915 3.365 0.00 1.00
.25 .50 .75 .90 .95
2.00 3.00 5.00 7.00 10.05
lowest : 0 1 2 3 4, highest: 38 40 41 44 50
--------------------------------------------------------------------------------
MARR1
n missing distinct Info Mean Gmd .05 .10
10000 0 87 0.999 58.27 13.57 34 42
.25 .50 .75 .90 .95
52 61 67 71 74
lowest : 0 1 2 7 8, highest: 87 88 89 90 99
--------------------------------------------------------------------------------
MARR2
n missing distinct Info Mean Gmd .05 .10
10000 0 38 0.994 10.67 4.876 4 6
.25 .50 .75 .90 .95
8 10 13 17 19
lowest : 0 1 2 3 4, highest: 40 46 47 50 99
--------------------------------------------------------------------------------
MARR3
n missing distinct Info Mean Gmd .05 .10
10000 0 48 0.993 7.473 4.921 2 3
.25 .50 .75 .90 .95
4 7 9 13 16
lowest : 0 1 2 3 4, highest: 45 48 50 54 60
--------------------------------------------------------------------------------
MARR4
n missing distinct Info Mean Gmd .05 .10
10000 0 81 0.997 22.9 8.944 13 15
.25 .50 .75 .90 .95
18 21 26 34 39
lowest : 0 1 2 3 4, highest: 80 85 91 93 98
--------------------------------------------------------------------------------
HHP1
n missing distinct Info Mean Gmd .05 .10
10000 0 323 1 185.4 53.55 108 132
.25 .50 .75 .90 .95
159 181 211 248 272
lowest : 0 50 54 55 56, highest: 424 448 453 460 497
--------------------------------------------------------------------------------
HHP2
n missing distinct Info Mean Gmd .05 .10
10000 0 319 1 259.9 51.73 181 204
.25 .50 .75 .90 .95
236 262 287 314 331
lowest : 0 100 110 111 113, highest: 458 482 487 490 495
--------------------------------------------------------------------------------
DW1
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.999 70.31 27.22 16 32
.25 .50 .75 .90 .95
58 76 90 99 99
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
DW2
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.999 66.1 29.35 9 24
.25 .50 .75 .90 .95
51 72 87 97 99
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
DW3
n missing distinct Info Mean Gmd .05 .10
10000 0 57 0.934 2.86 4.087 0 0
.25 .50 .75 .90 .95
0 1 3 8 12
lowest : 0 1 2 3 4, highest: 58 59 61 62 63
--------------------------------------------------------------------------------
DW4
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.994 19.4 24.16 0 0
.25 .50 .75 .90 .95
1 9 30 57 74
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
DW5
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.981 16.52 22.07 0 0
.25 .50 .75 .90 .95
0 6 24 51 69
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
DW6
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.954 13.05 18.77 0 0
.25 .50 .75 .90 .95
0 3 18 42 60
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
DW7
n missing distinct Info Mean Gmd .05 .10
10000 0 68 0.655 1.783 3.134 0 0
.25 .50 .75 .90 .95
0 0 1 5 10
lowest : 0 1 2 3 4, highest: 75 78 87 90 99
--------------------------------------------------------------------------------
DW8
n missing distinct Info Mean Gmd .05 .10
10000 0 54 0.518 1.175 2.127 0 0
.25 .50 .75 .90 .95
0 0 0 3 7
lowest : 0 1 2 3 4, highest: 66 70 74 78 90
--------------------------------------------------------------------------------
DW9
n missing distinct Info Mean Gmd .05 .10
10000 0 51 0.331 0.6012 1.149 0 0
.25 .50 .75 .90 .95
0 0 0 1 2
lowest : 0 1 2 3 4, highest: 65 67 81 87 99
--------------------------------------------------------------------------------
HV1
n missing distinct Info Mean Gmd .05 .10
10000 0 2494 1 1080 873.8 313 376
.25 .50 .75 .90 .95
519 749 1271 2274 3021
lowest : 0 75 126 128 130, highest: 5965 5975 5985 5986 6000
--------------------------------------------------------------------------------
HV2
n missing distinct Info Mean Gmd .05 .10
10000 0 2622 1 1151 905.5 349.0 413.9
.25 .50 .75 .90 .95
559.0 816.0 1381.2 2397.2 3158.0
lowest : 0 125 162 163 168, highest: 5900 5934 5953 5955 6000
--------------------------------------------------------------------------------
HV3
n missing distinct Info Mean Gmd .05 .10
10000 0 14 0.973 4.281 2.477 2 2
.25 .50 .75 .90 .95
3 4 5 7 9
lowest : 0 1 2 3 4, highest: 9 10 11 12 13
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 87 321 1819 2257 1833 1284 900 527 328 258 116
Proportion 0.009 0.032 0.182 0.226 0.183 0.128 0.090 0.053 0.033 0.026 0.012
Value 11 12 13
Frequency 121 131 18
Proportion 0.012 0.013 0.002
--------------------------------------------------------------------------------
HV4
n missing distinct Info Mean Gmd .05 .10
10000 0 14 0.978 3.918 2.479 1 1
.25 .50 .75 .90 .95
2 4 5 7 8
lowest : 0 1 2 3 4, highest: 9 10 11 12 13
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 127 1032 1885 1941 1674 1161 836 529 353 229 140
Proportion 0.013 0.103 0.188 0.194 0.167 0.116 0.084 0.053 0.035 0.023 0.014
Value 11 12 13
Frequency 72 15 6
Proportion 0.007 0.002 0.001
--------------------------------------------------------------------------------
HU1
n missing distinct Info Mean Gmd .05 .10
10000 0 100 1 70 23.08 23 38
.25 .50 .75 .90 .95
60 76 85 92 95
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HU2
n missing distinct Info Mean Gmd .05 .10
10000 0 100 1 29.26 22.48 5 8
.25 .50 .75 .90 .95
14 24 39 60 74
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HU3
n missing distinct Info Mean Gmd .05 .10
10000 0 87 0.995 90.17 9.906 69 80
.25 .50 .75 .90 .95
89 94 97 98 99
lowest : 0 4 12 14 16, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HU4
n missing distinct Info Mean Gmd .05 .10
10000 0 88 0.995 9.105 8.753 1 2
.25 .50 .75 .90 .95
3 6 11 19 28
lowest : 0 1 2 3 4, highest: 84 86 88 96 99
--------------------------------------------------------------------------------
HU5
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.967 13.52 19.01 0 0
.25 .50 .75 .90 .95
0 4 16 47 65
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HHD1
n missing distinct Info Mean Gmd .05 .10
10000 0 87 0.999 35.59 14.4 11 19
.25 .50 .75 .90 .95
29 36 44 51 56
lowest : 0 1 2 3 4, highest: 83 84 87 88 96
--------------------------------------------------------------------------------
HHD2
n missing distinct Info Mean Gmd .05 .10
10000 0 94 0.999 71.63 15.37 43 53
.25 .50 .75 .90 .95
66 75 81 86 89
lowest : 0 4 7 9 10, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HHD3
n missing distinct Info Mean Gmd .05 .10
10000 0 94 1 58.95 17.75 28 37
.25 .50 .75 .90 .95
50 62 70 77 81
lowest : 0 1 4 5 6, highest: 92 93 94 95 99
--------------------------------------------------------------------------------
HHD4
n missing distinct Info Mean Gmd .05 .10
10000 0 81 0.999 27.8 13.36 7 12
.25 .50 .75 .90 .95
20 28 35 43 48
lowest : 0 1 2 3 4, highest: 78 79 80 81 91
--------------------------------------------------------------------------------
HHD5
n missing distinct Info Mean Gmd .05 .10
10000 0 91 0.998 82.07 12.63 56 68
.25 .50 .75 .90 .95
79 86 90 93 94
lowest : 0 2 5 7 8, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HHD6
n missing distinct Info Mean Gmd .05 .10
10000 0 92 0.998 17.24 11.73 5 7
.25 .50 .75 .90 .95
10 14 21 31 41
lowest : 0 1 2 3 4, highest: 92 93 95 98 99
--------------------------------------------------------------------------------
HHD7
n missing distinct Info Mean Gmd .05 .10
10000 0 49 0.993 7.793 5.134 2 3
.25 .50 .75 .90 .95
5 7 10 14 18
lowest : 0 1 2 3 4, highest: 44 45 46 47 48
--------------------------------------------------------------------------------
HHD8
n missing distinct Info Mean Gmd .05 .10
10000 0 11 0.893 1.614 1.107 0 0
.25 .50 .75 .90 .95
1 1 2 3 3
lowest : 0 1 2 3 4, highest: 6 7 8 9 10
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 1061 4095 3295 1050 307 119 43 18 8 3 1
Proportion 0.106 0.410 0.330 0.105 0.031 0.012 0.004 0.002 0.001 0.000 0.000
--------------------------------------------------------------------------------
HHD9
n missing distinct Info Mean Gmd .05 .10
10000 0 45 0.989 6.17 4.387 1 2
.25 .50 .75 .90 .95
3 5 8 11 15
lowest : 0 1 2 3 4, highest: 40 41 42 43 44
--------------------------------------------------------------------------------
HHD10
n missing distinct Info Mean Gmd .05 .10
10000 0 64 0.996 13.57 7.137 6 7
.25 .50 .75 .90 .95
9 12 16 22 27
lowest : 0 1 2 3 4, highest: 68 74 88 89 99
--------------------------------------------------------------------------------
HHD11
n missing distinct Info Mean Gmd .05 .10
10000 0 68 0.999 18.95 10.17 7 9
.25 .50 .75 .90 .95
12 18 24 31 36
lowest : 0 1 2 3 4, highest: 64 65 72 73 99
--------------------------------------------------------------------------------
HHD12
n missing distinct Info Mean Gmd .05 .10
10000 0 41 0.983 4.768 3.729 1 1
.25 .50 .75 .90 .95
2 4 6 9 12
lowest : 0 1 2 3 4, highest: 36 38 39 46 91
--------------------------------------------------------------------------------
ETHC1
n missing distinct Info Mean Gmd .05 .10
10000 0 42 0.998 16.71 7.709 3 7
.25 .50 .75 .90 .95
13 18 22 25 26
lowest : 0 1 2 3 4, highest: 37 38 39 41 42
--------------------------------------------------------------------------------
ETHC2
n missing distinct Info Mean Gmd .05 .10
10000 0 90 0.999 50.72 14.92 18 31
.25 .50 .75 .90 .95
46 54 60 64 66
lowest : 0 1 2 3 4, highest: 85 86 87 88 89
--------------------------------------------------------------------------------
ETHC3
n missing distinct Info Mean Gmd .05 .10
10000 0 96 0.999 17.51 12.44 3 5
.25 .50 .75 .90 .95
10 15 22 30 39
lowest : 0 1 2 3 4, highest: 93 94 95 96 99
--------------------------------------------------------------------------------
ETHC4
n missing distinct Info Mean Gmd .05 .10
10000 0 38 0.804 1.951 3.194 0 0
.25 .50 .75 .90 .95
0 0 1 6 12
lowest : 0 1 2 3 4, highest: 33 34 36 37 39
--------------------------------------------------------------------------------
ETHC5
n missing distinct Info Mean Gmd .05 .10
10000 0 70 0.927 4.596 7.256 0 0
.25 .50 .75 .90 .95
0 1 4 12 25
lowest : 0 1 2 3 4, highest: 65 66 67 68 70
--------------------------------------------------------------------------------
ETHC6
n missing distinct Info Mean Gmd .05 .10
10000 0 36 0.522 0.8818 1.592 0 0
.25 .50 .75 .90 .95
0 0 0 2 5
lowest : 0 1 2 3 4, highest: 31 32 34 36 54
--------------------------------------------------------------------------------
HVP1
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.925 14.2 22.59 0 0
.25 .50 .75 .90 .95
0 1 11 65 88
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HVP2
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.982 21.99 31.17 0 0
.25 .50 .75 .90 .95
1 4 32 88 96
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HVP3
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.998 35.99 40.28 0 1
.25 .50 .75 .90 .95
4 18 73 97 99
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HVP4
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.997 51.98 41.83 2 4
.25 .50 .75 .90 .95
15 50 93 99 99
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HVP5
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.976 74.45 30.99 15 26
.25 .50 .75 .90 .95
53 90 99 99 99
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HVP6
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.745 6.579 11.61 0 0
.25 .50 .75 .90 .95
0 0 2 18 50
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HUR1
n missing distinct Info Mean Gmd .05 .10
10000 0 78 0.982 4.932 6.285 0 0
.25 .50 .75 .90 .95
1 2 6 12 19
lowest : 0 1 2 3 4, highest: 81 83 88 89 90
--------------------------------------------------------------------------------
HUR2
n missing distinct Info Mean Gmd .05 .10
10000 0 100 1 45.9 24.33 11 19
.25 .50 .75 .90 .95
32 44 60 76 86
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
RHP1
n missing distinct Info Mean Gmd .05 .10
10000 0 73 0.999 53.04 11.27 37 42
.25 .50 .75 .90 .95
48 52 58 66 71
lowest : 0 8 13 14 16, highest: 80 81 82 83 84
--------------------------------------------------------------------------------
RHP2
n missing distinct Info Mean Gmd .05 .10
10000 0 72 0.999 53.88 10.82 38 43
.25 .50 .75 .90 .95
49 54 59 66 70
lowest : 0 14 16 17 18, highest: 82 83 84 85 86
--------------------------------------------------------------------------------
RHP3
n missing distinct Info Mean Gmd .05 .10
10000 0 21 0.974 14.15 2.522 10 12
.25 .50 .75 .90 .95
13 14 15 17 18
lowest : 0 10 11 12 13, highest: 25 26 27 29 33
--------------------------------------------------------------------------------
RHP4
n missing distinct Info Mean Gmd .05 .10
10000 0 17 0.86 4.404 1.095 3 3
.25 .50 .75 .90 .95
4 4 5 5 6
lowest : 0 1 2 3 4, highest: 12 13 14 15 17
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 74 36 136 974 4762 3137 527 163 82 49 26
Proportion 0.007 0.004 0.014 0.097 0.476 0.314 0.053 0.016 0.008 0.005 0.003
Value 11 12 13 14 15 17
Frequency 13 8 8 1 3 1
Proportion 0.001 0.001 0.001 0.000 0.000 0.000
--------------------------------------------------------------------------------
HUPA1
n missing distinct Info Mean Gmd .05 .10
10000 0 90 0.991 10.12 12.36 0 0
.25 .50 .75 .90 .95
1 5 14 27 36
lowest : 0 1 2 3 4, highest: 86 87 89 93 95
--------------------------------------------------------------------------------
HUPA2
n missing distinct Info Mean Gmd .05 .10
10000 0 99 0.896 9.264 14.34 0 0
.25 .50 .75 .90 .95
0 1 10 31 48
lowest : 0 1 2 3 4, highest: 94 95 96 98 99
--------------------------------------------------------------------------------
HUPA3
n missing distinct Info Mean Gmd .05 .10
10000 0 97 0.869 8.517 12.72 0 0
.25 .50 .75 .90 .95
0 0 13 27 38
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HUPA4
n missing distinct Info Mean Gmd .05 .10
10000 0 67 0.998 11.15 8.039 2 3
.25 .50 .75 .90 .95
6 10 15 20 25
lowest : 0 1 2 3 4, highest: 82 85 86 88 99
--------------------------------------------------------------------------------
HUPA5
n missing distinct Info Mean Gmd .05 .10
10000 0 61 0.976 5.161 6.699 0 0
.25 .50 .75 .90 .95
0 2 7 14 20
lowest : 0 1 2 3 4, highest: 57 58 71 73 92
--------------------------------------------------------------------------------
HUPA6
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.943 10.87 15.87 0 0
.25 .50 .75 .90 .95
0 2 14 35 51
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HUPA7
n missing distinct Info Mean Gmd .05 .10
10000 0 37 0.796 1.623 2.471 0 0
.25 .50 .75 .90 .95
0 0 2 5 7
lowest : 0 1 2 3 4, highest: 42 46 50 63 71
--------------------------------------------------------------------------------
RP1
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.995 29.97 35.23 0 0
.25 .50 .75 .90 .95
2 15 56 85 92
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
RP2
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.999 42.94 39.88 0 1
.25 .50 .75 .90 .95
8 38 78 93 96
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
RP3
n missing distinct Info Mean Gmd .05 .10
10000 0 100 1 60.06 37.07 4 9
.25 .50 .75 .90 .95
30 69 91 97 98
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
RP4
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.999 76.84 25.18 24 38
.25 .50 .75 .90 .95
65 86 96 98 99
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
MSA
n missing distinct Info Mean Gmd .05 .10
9985 15 269 0.989 3569 3287 0 0
.25 .50 .75 .90 .95
520 3360 5960 7490 8280
lowest : 0 40 120 200 220, highest: 9200 9260 9270 9340 9360
--------------------------------------------------------------------------------
ADI
n missing distinct Info Mean Gmd .05 .10
9985 15 187 0.999 186.1 152.5 13 15
.25 .50 .75 .90 .95
65 175 279 373 431
lowest : 0 3 9 11 13, highest: 613 621 625 627 645
--------------------------------------------------------------------------------
DMA
n missing distinct Info Mean Gmd .05 .10
9985 15 188 0.999 664 129.6 515 524
.25 .50 .75 .90 .95
561 634 800 819 825
lowest : 0 501 503 504 505, highest: 855 862 866 868 881
--------------------------------------------------------------------------------
IC1
n missing distinct Info Mean Gmd .05 .10
10000 0 825 1 342.7 169.1 151 181
.25 .50 .75 .90 .95
233 313 420 540 628
lowest : 0 50 52 55 57, highest: 1351 1376 1388 1397 1500
--------------------------------------------------------------------------------
IC2
n missing distinct Info Mean Gmd .05 .10
10000 0 874 1 389.9 180.1 183 219
.25 .50 .75 .90 .95
275 360 469 600 692
lowest : 0 50 51 53 59, highest: 1386 1422 1453 1469 1500
--------------------------------------------------------------------------------
# Pas dans les 200 premières variables
describe(data.train[,201:300])
data.train[, 201:300]
100 Variables 10000 Observations
--------------------------------------------------------------------------------
IC3
n missing distinct Info Mean Gmd .05 .10
10000 0 830 1 384.6 169.6 197 225
.25 .50 .75 .90 .95
277 353 463 590 690
lowest : 0 25 41 62 68, highest: 1153 1157 1199 1208 1226
--------------------------------------------------------------------------------
IC4
n missing distinct Info Mean Gmd .05 .10
10000 0 877 1 428.1 181.1 225 259
.25 .50 .75 .90 .95
314 395 511 652 749
lowest : 0 71 75 80 85, highest: 1239 1246 1252 1272 1273
--------------------------------------------------------------------------------
IC5
n missing distinct Info Mean Gmd .05 .10
10000 0 7301 1 15546 7733 7462 8789
.25 .50 .75 .90 .95
10770 13626 17962 24128 29466
lowest : 0 1500 1972 2447 2582, highest: 87766 91191 94863 98040 105251
--------------------------------------------------------------------------------
IC6
n missing distinct Info Mean Gmd .05 .10
10000 0 89 0.999 21.63 16.11 3 5
.25 .50 .75 .90 .95
10 19 31 41 49
lowest : 0 1 2 3 4, highest: 84 85 96 97 99
--------------------------------------------------------------------------------
IC7
n missing distinct Info Mean Gmd .05 .10
10000 0 54 0.999 16.99 8.957 3 6
.25 .50 .75 .90 .95
11 17 22 27 30
lowest : 0 1 2 3 4, highest: 49 50 53 55 99
--------------------------------------------------------------------------------
IC8
n missing distinct Info Mean Gmd .05 .10
10000 0 46 0.997 15.78 6.99 5 8
.25 .50 .75 .90 .95
12 16 19 23 26
lowest : 0 1 2 3 4, highest: 42 44 53 64 99
--------------------------------------------------------------------------------
IC9
n missing distinct Info Mean Gmd .05 .10
10000 0 55 0.998 18.32 8.309 7 9
.25 .50 .75 .90 .95
13 18 23 28 31
lowest : 0 1 2 3 4, highest: 50 53 57 60 99
--------------------------------------------------------------------------------
IC10
n missing distinct Info Mean Gmd .05 .10
10000 0 56 0.999 15.92 10.89 3 5
.25 .50 .75 .90 .95
8 14 22 30 34
lowest : 0 1 2 3 4, highest: 51 52 55 60 61
--------------------------------------------------------------------------------
IC11
n missing distinct Info Mean Gmd .05 .10
10000 0 39 0.99 5.474 5.857 0 0
.25 .50 .75 .90 .95
1 4 8 14 17
lowest : 0 1 2 3 4, highest: 34 35 36 37 39
--------------------------------------------------------------------------------
IC12
n missing distinct Info Mean Gmd .05 .10
10000 0 28 0.922 2.164 2.958 0 0
.25 .50 .75 .90 .95
0 1 3 6 9
lowest : 0 1 2 3 4, highest: 23 24 26 27 31
--------------------------------------------------------------------------------
IC13
n missing distinct Info Mean Gmd .05 .10
10000 0 19 0.725 0.9079 1.438 0 0
.25 .50 .75 .90 .95
0 0 1 3 4
lowest : 0 1 2 3 4, highest: 14 15 16 17 21
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 6471 1515 846 427 242 152 111 69 39 45 30
Proportion 0.647 0.152 0.085 0.043 0.024 0.015 0.011 0.007 0.004 0.004 0.003
Value 11 12 13 14 15 16 17 21
Frequency 10 16 9 7 3 5 2 1
Proportion 0.001 0.002 0.001 0.001 0.000 0.000 0.000 0.000
--------------------------------------------------------------------------------
IC14
n missing distinct Info Mean Gmd .05 .10
10000 0 54 0.826 1.83 2.924 0 0
.25 .50 .75 .90 .95
0 0 2 5 8
lowest : 0 1 2 3 4, highest: 52 55 56 57 59
--------------------------------------------------------------------------------
IC15
n missing distinct Info Mean Gmd .05 .10
10000 0 80 0.999 14.88 13.15 0 2
.25 .50 .75 .90 .95
5 12 21 31 39
lowest : 0 1 2 3 4, highest: 80 84 85 95 99
--------------------------------------------------------------------------------
IC16
n missing distinct Info Mean Gmd .05 .10
10000 0 66 0.999 16 10.4 2 4
.25 .50 .75 .90 .95
9 16 22 28 31
lowest : 0 1 2 3 4, highest: 64 67 71 82 99
--------------------------------------------------------------------------------
IC17
n missing distinct Info Mean Gmd .05 .10
10000 0 58 0.998 16.51 8.47 4 7
.25 .50 .75 .90 .95
12 17 21 26 29
lowest : 0 1 2 3 4, highest: 62 64 68 73 99
--------------------------------------------------------------------------------
IC18
n missing distinct Info Mean Gmd .05 .10
10000 0 62 0.999 20.6 9.393 7 10
.25 .50 .75 .90 .95
15 21 26 31 34
lowest : 0 1 2 3 4, highest: 59 60 61 66 99
--------------------------------------------------------------------------------
IC19
n missing distinct Info Mean Gmd .05 .10
10000 0 61 0.999 18.68 12.11 3 6
.25 .50 .75 .90 .95
10 17 26 33 38
lowest : 0 1 2 3 4, highest: 56 57 59 61 70
--------------------------------------------------------------------------------
IC20
n missing distinct Info Mean Gmd .05 .10
10000 0 42 0.991 6.48 6.883 0 0
.25 .50 .75 .90 .95
2 4 10 16 20
lowest : 0 1 2 3 4, highest: 37 38 39 42 55
--------------------------------------------------------------------------------
IC21
n missing distinct Info Mean Gmd .05 .10
10000 0 31 0.918 2.533 3.5 0 0
.25 .50 .75 .90 .95
0 1 3 8 11
lowest : 0 1 2 3 4, highest: 29 31 36 38 40
--------------------------------------------------------------------------------
IC22
n missing distinct Info Mean Gmd .05 .10
10000 0 21 0.729 1.091 1.737 0 0
.25 .50 .75 .90 .95
0 0 1 4 6
lowest : 0 1 2 3 4, highest: 16 17 18 19 33
--------------------------------------------------------------------------------
IC23
n missing distinct Info Mean Gmd .05 .10
10000 0 60 0.823 2.218 3.579 0 0
.25 .50 .75 .90 .95
0 0 2 6 11
lowest : 0 1 2 3 4, highest: 60 61 62 67 70
--------------------------------------------------------------------------------
HHAS1
n missing distinct Info Mean Gmd .05 .10
10000 0 95 0.999 26.58 14.85 7 11
.25 .50 .75 .90 .95
17 26 34 43 49
lowest : 0 1 2 3 4, highest: 91 92 93 95 99
--------------------------------------------------------------------------------
HHAS2
n missing distinct Info Mean Gmd .05 .10
10000 0 57 0.993 6.165 6.018 0 0
.25 .50 .75 .90 .95
2 5 8 13 18
lowest : 0 1 2 3 4, highest: 55 57 59 60 76
--------------------------------------------------------------------------------
HHAS3
n missing distinct Info Mean Gmd .05 .10
10000 0 96 1 42.65 19.28 15 21
.25 .50 .75 .90 .95
31 43 54 65 71
lowest : 0 1 2 3 4, highest: 91 92 94 96 99
--------------------------------------------------------------------------------
HHAS4
n missing distinct Info Mean Gmd .05 .10
10000 0 74 0.998 10.86 10.14 1 2
.25 .50 .75 .90 .95
4 8 15 23 30
lowest : 0 1 2 3 4, highest: 75 79 89 91 99
--------------------------------------------------------------------------------
MC1
n missing distinct Info Mean Gmd .05 .10
10000 0 93 1 48.26 17.76 25 30
.25 .50 .75 .90 .95
37 47 58 70 77
lowest : 0 5 8 9 10, highest: 94 95 96 98 99
--------------------------------------------------------------------------------
MC2
n missing distinct Info Mean Gmd .05 .10
10000 0 94 1 50.82 17.81 21 29
.25 .50 .75 .90 .95
41 53 62 69 74
lowest : 0 1 2 4 5, highest: 90 91 92 95 99
--------------------------------------------------------------------------------
MC3
n missing distinct Info Mean Gmd .05 .10
10000 0 78 0.998 12.16 10.32 1 3
.25 .50 .75 .90 .95
5 9 16 25 31
lowest : 0 1 2 3 4, highest: 84 86 88 91 99
--------------------------------------------------------------------------------
TPE1
n missing distinct Info Mean Gmd .05 .10
10000 0 84 0.999 76.05 12.83 54 63
.25 .50 .75 .90 .95
72 79 84 88 91
lowest : 0 6 11 16 17, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
TPE2
n missing distinct Info Mean Gmd .05 .10
10000 0 50 0.997 13 7.268 3 5
.25 .50 .75 .90 .95
9 12 17 21 25
lowest : 0 1 2 3 4, highest: 45 46 47 49 56
--------------------------------------------------------------------------------
TPE3
n missing distinct Info Mean Gmd .05 .10
10000 0 57 0.848 2.34 3.726 0 0
.25 .50 .75 .90 .95
0 0 2 6 12
lowest : 0 1 2 3 4, highest: 53 55 58 60 62
--------------------------------------------------------------------------------
TPE4
n missing distinct Info Mean Gmd .05 .10
10000 0 52 0.797 1.81 2.953 0 0
.25 .50 .75 .90 .95
0 0 2 5 9
lowest : 0 1 2 3 4, highest: 50 51 55 58 62
--------------------------------------------------------------------------------
TPE5
n missing distinct Info Mean Gmd .05 .10
10000 0 31 0.224 0.4137 0.7936 0 0
.25 .50 .75 .90 .95
0 0 0 0 2
lowest : 0 1 2 3 4, highest: 29 32 36 44 50
--------------------------------------------------------------------------------
TPE6
n missing distinct Info Mean Gmd .05 .10
10000 0 18 0.156 0.1101 0.2132 0 0
.25 .50 .75 .90 .95
0 0 0 0 1
lowest : 0 1 2 3 4, highest: 21 26 27 29 30
Value 0 1 2 3 4 5 6 7 8 9 11
Frequency 9450 359 100 44 18 7 2 3 4 2 3
Proportion 0.945 0.036 0.010 0.004 0.002 0.001 0.000 0.000 0.000 0.000 0.000
Value 13 14 21 26 27 29 30
Frequency 1 1 1 1 2 1 1
Proportion 0.000 0.000 0.000 0.000 0.000 0.000 0.000
--------------------------------------------------------------------------------
TPE7
n missing distinct Info Mean Gmd .05 .10
10000 0 14 0.411 0.242 0.4258 0 0
.25 .50 .75 .90 .95
0 0 0 1 1
lowest : 0 1 2 3 4, highest: 10 11 12 16 22
Value 0 1 2 3 4 5 6 7 8 10 11
Frequency 8373 1140 333 92 33 15 4 4 1 1 1
Proportion 0.837 0.114 0.033 0.009 0.003 0.002 0.000 0.000 0.000 0.000 0.000
Value 12 16 22
Frequency 1 1 1
Proportion 0.000 0.000 0.000
--------------------------------------------------------------------------------
TPE8
n missing distinct Info Mean Gmd .05 .10
10000 0 60 0.984 4.128 4.588 0 0
.25 .50 .75 .90 .95
1 3 5 9 13
lowest : 0 1 2 3 4, highest: 62 65 68 69 99
--------------------------------------------------------------------------------
TPE9
n missing distinct Info Mean Gmd .05 .10
10000 0 38 0.98 3.268 3.34 0 0
.25 .50 .75 .90 .95
1 2 4 7 10
lowest : 0 1 2 3 4, highest: 34 35 36 40 55
--------------------------------------------------------------------------------
PEC1
n missing distinct Info Mean Gmd .05 .10
10000 0 69 0.883 2.311 3.675 0 0
.25 .50 .75 .90 .95
0 1 2 4 10
lowest : 0 1 2 3 4, highest: 67 68 72 75 99
--------------------------------------------------------------------------------
PEC2
n missing distinct Info Mean Gmd .05 .10
10000 0 97 0.999 18.36 19.97 0 1
.25 .50 .75 .90 .95
4 11 27 48 60
lowest : 0 1 2 3 4, highest: 92 94 95 96 99
--------------------------------------------------------------------------------
TPE10
n missing distinct Info Mean Gmd .05 .10
10000 0 52 0.997 19.39 7.059 10 12
.25 .50 .75 .90 .95
15 19 23 28 31
lowest : 0 3 4 5 6, highest: 51 52 53 59 90
--------------------------------------------------------------------------------
TPE11
n missing distinct Info Mean Gmd .05 .10
10000 0 53 0.998 23.63 7.265 14 16
.25 .50 .75 .90 .95
19 23 28 32 35
lowest : 0 3 5 7 8, highest: 52 53 54 57 64
--------------------------------------------------------------------------------
TPE12
n missing distinct Info Mean Gmd .05 .10
10000 0 45 0.991 5.242 5.177 0 0
.25 .50 .75 .90 .95
2 4 7 12 15
lowest : 0 1 2 3 4, highest: 41 43 44 47 50
--------------------------------------------------------------------------------
TPE13
n missing distinct Info Mean Gmd .05 .10
10000 0 97 1 59.89 19.25 26 35
.25 .50 .75 .90 .95
50 63 73 79 82
lowest : 0 3 4 5 6, highest: 94 95 96 97 99
--------------------------------------------------------------------------------
LFC1
n missing distinct Info Mean Gmd .05 .10
10000 0 96 0.999 64.47 14.01 42 50
.25 .50 .75 .90 .95
59 66 73 79 82
lowest : 0 2 5 6 7, highest: 94 96 97 98 99
--------------------------------------------------------------------------------
LFC2
n missing distinct Info Mean Gmd .05 .10
10000 0 97 0.999 73.82 14.65 49 59
.25 .50 .75 .90 .95
68 76 83 88 91
lowest : 0 2 4 6 7, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
LFC3
n missing distinct Info Mean Gmd .05 .10
10000 0 96 0.999 55.94 14.74 33 40
.25 .50 .75 .90 .95
49 57 65 71 76
lowest : 0 2 4 5 6, highest: 93 94 95 96 99
--------------------------------------------------------------------------------
LFC4
n missing distinct Info Mean Gmd .05 .10
10000 0 98 0.999 69.79 15.43 43.95 53.00
.25 .50 .75 .90 .95
63.00 72.00 79.00 85.00 88.00
lowest : 0 2 4 5 6, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
LFC5
n missing distinct Info Mean Gmd .05 .10
10000 0 95 0.999 52.83 14.91 30 37
.25 .50 .75 .90 .95
45 54 62 69 73
lowest : 0 2 4 5 6, highest: 92 93 94 96 99
--------------------------------------------------------------------------------
LFC6
n missing distinct Info Mean Gmd .05 .10
10000 0 94 0.999 63.99 17.21 36.0 46.0
.25 .50 .75 .90 .95
57.0 66.0 74.0 81.1 86.0
lowest : 0 4 7 8 10, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
LFC7
n missing distinct Info Mean Gmd .05 .10
10000 0 98 1 48.79 19.02 17 27
.25 .50 .75 .90 .95
40 50 60 69 75
lowest : 0 1 2 3 4, highest: 93 94 95 96 99
--------------------------------------------------------------------------------
LFC8
n missing distinct Info Mean Gmd .05 .10
10000 0 95 0.974 69.93 33 0 8
.25 .50 .75 .90 .95
55 78 99 99 99
lowest : 0 3 7 8 9, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
LFC9
n missing distinct Info Mean Gmd .05 .10
10000 0 91 0.848 62.91 45.3 0 0
.25 .50 .75 .90 .95
0 97 99 99 99
lowest : 0 6 7 11 13, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
LFC10
n missing distinct Info Mean Gmd .05 .10
10000 0 78 0.972 6.568 8.268 0 0
.25 .50 .75 .90 .95
0 4 9 17 24
lowest : 0 1 2 3 4, highest: 80 82 84 95 99
--------------------------------------------------------------------------------
OCC1
n missing distinct Info Mean Gmd .05 .10
10000 0 59 0.998 14.07 8.897 3 5
.25 .50 .75 .90 .95
8 13 18 25 30
lowest : 0 1 2 3 4, highest: 55 58 59 61 99
--------------------------------------------------------------------------------
OCC2
n missing distinct Info Mean Gmd .05 .10
10000 0 47 0.998 12.48 7.738 3 5
.25 .50 .75 .90 .95
8 11 17 22 26
lowest : 0 1 2 3 4, highest: 42 43 44 46 68
--------------------------------------------------------------------------------
OCC3
n missing distinct Info Mean Gmd .05 .10
10000 0 25 0.981 3.567 2.693 0 0
.25 .50 .75 .90 .95
2 3 5 7 8
lowest : 0 1 2 3 4, highest: 21 22 30 33 99
--------------------------------------------------------------------------------
OCC4
n missing distinct Info Mean Gmd .05 .10
10000 0 49 0.996 12.31 6.056 4 6
.25 .50 .75 .90 .95
9 12 15 19 22
lowest : 0 1 2 3 4, highest: 44 49 50 51 55
--------------------------------------------------------------------------------
OCC5
n missing distinct Info Mean Gmd .05 .10
10000 0 48 0.997 15.48 6.239 7 9
.25 .50 .75 .90 .95
12 15 19 22 25
lowest : 0 1 2 3 4, highest: 43 50 52 63 68
--------------------------------------------------------------------------------
OCC6
n missing distinct Info Mean Gmd .05 .10
10000 0 18 0.592 0.4224 0.6911 0 0
.25 .50 .75 .90 .95
0 0 1 1 2
lowest : 0 1 2 3 4, highest: 13 14 15 18 20
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 7381 1801 469 190 71 27 29 10 5 4 2
Proportion 0.738 0.180 0.047 0.019 0.007 0.003 0.003 0.001 0.000 0.000 0.000
Value 11 12 13 14 15 18 20
Frequency 3 1 2 1 1 2 1
Proportion 0.000 0.000 0.000 0.000 0.000 0.000 0.000
--------------------------------------------------------------------------------
OCC7
n missing distinct Info Mean Gmd .05 .10
10000 0 20 0.945 1.657 1.776 0 0
.25 .50 .75 .90 .95
0 1 2 4 5
lowest : 0 1 2 3 4, highest: 15 19 21 24 27
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 2911 2685 2115 1095 568 294 148 61 33 27 15
Proportion 0.291 0.268 0.212 0.110 0.057 0.029 0.015 0.006 0.003 0.003 0.002
Value 11 12 13 14 15 19 21 24 27
Frequency 14 6 7 12 5 1 1 1 1
Proportion 0.001 0.001 0.001 0.001 0.000 0.000 0.000 0.000 0.000
--------------------------------------------------------------------------------
OCC8
n missing distinct Info Mean Gmd .05 .10
10000 0 53 0.996 10.64 6.399 3 4
.25 .50 .75 .90 .95
7 10 13 18 22
lowest : 0 1 2 3 4, highest: 49 50 52 53 74
--------------------------------------------------------------------------------
OCC9
n missing distinct Info Mean Gmd .05 .10
10000 0 49 0.944 2.832 3.925 0 0
.25 .50 .75 .90 .95
0 1 3 7 12
lowest : 0 1 2 3 4, highest: 47 49 50 52 99
--------------------------------------------------------------------------------
OCC10
n missing distinct Info Mean Gmd .05 .10
10000 0 46 0.997 11.52 6.623 2 4
.25 .50 .75 .90 .95
7 11 15 19 21
lowest : 0 1 2 3 4, highest: 46 49 63 65 99
--------------------------------------------------------------------------------
OCC11
n missing distinct Info Mean Gmd .05 .10
10000 0 44 0.994 6.3 6.059 0 0
.25 .50 .75 .90 .95
2 5 9 14 18
lowest : 0 1 2 3 4, highest: 42 48 50 54 61
--------------------------------------------------------------------------------
OCC12
n missing distinct Info Mean Gmd .05 .10
10000 0 29 0.988 4.077 3.462 0 0
.25 .50 .75 .90 .95
2 4 6 8 10
lowest : 0 1 2 3 4, highest: 27 31 48 50 55
--------------------------------------------------------------------------------
OCC13
n missing distinct Info Mean Gmd .05 .10
10000 0 29 0.985 3.679 3.108 0 0
.25 .50 .75 .90 .95
2 3 5 7 9
lowest : 0 1 2 3 4, highest: 24 27 30 36 38
--------------------------------------------------------------------------------
EIC1
n missing distinct Info Mean Gmd .05 .10
10000 0 52 0.955 3.1 4.219 0 0
.25 .50 .75 .90 .95
0 1 4 8 13
lowest : 0 1 2 3 4, highest: 50 51 54 55 99
--------------------------------------------------------------------------------
EIC2
n missing distinct Info Mean Gmd .05 .10
10000 0 35 0.537 0.7198 1.278 0 0
.25 .50 .75 .90 .95
0 0 0 2 4
lowest : 0 1 2 3 4, highest: 34 37 38 39 56
--------------------------------------------------------------------------------
EIC3
n missing distinct Info Mean Gmd .05 .10
10000 0 34 0.993 6.449 4.388 0 2
.25 .50 .75 .90 .95
4 6 9 11 14
lowest : 0 1 2 3 4, highest: 29 30 36 38 50
--------------------------------------------------------------------------------
EIC4
n missing distinct Info Mean Gmd .05 .10
10000 0 68 0.999 17 11.29 3 5
.25 .50 .75 .90 .95
9 15 23 31 36
lowest : 0 1 2 3 4, highest: 64 65 66 72 99
--------------------------------------------------------------------------------
EIC5
n missing distinct Info Mean Gmd .05 .10
10000 0 32 0.986 4.351 3.357 0 1
.25 .50 .75 .90 .95
2 4 6 8 10
lowest : 0 1 2 3 4, highest: 29 30 32 35 99
--------------------------------------------------------------------------------
EIC6
n missing distinct Info Mean Gmd .05 .10
10000 0 23 0.974 2.707 2.394 0 0
.25 .50 .75 .90 .95
1 2 4 6 7
lowest : 0 1 2 3 4, highest: 18 19 21 24 38
--------------------------------------------------------------------------------
EIC7
n missing distinct Info Mean Gmd .05 .10
10000 0 27 0.987 4.343 3.247 0 1
.25 .50 .75 .90 .95
2 4 6 8 10
lowest : 0 1 2 3 4, highest: 23 27 32 36 48
--------------------------------------------------------------------------------
EIC8
n missing distinct Info Mean Gmd .05 .10
10000 0 57 0.997 16.92 6.69 8 10
.25 .50 .75 .90 .95
13 16 20 24 27
lowest : 0 1 2 3 4, highest: 65 68 72 85 99
--------------------------------------------------------------------------------
EIC9
n missing distinct Info Mean Gmd .05 .10
10000 0 38 0.994 6.806 4.946 0 2
.25 .50 .75 .90 .95
4 6 9 13 15
lowest : 0 1 2 3 4, highest: 33 34 35 41 59
--------------------------------------------------------------------------------
EIC10
n missing distinct Info Mean Gmd .05 .10
10000 0 26 0.986 4.639 3.143 0 1
.25 .50 .75 .90 .95
3 4 6 8 10
lowest : 0 1 2 3 4, highest: 21 23 24 28 78
--------------------------------------------------------------------------------
EIC11
n missing distinct Info Mean Gmd .05 .10
10000 0 39 0.976 3.313 2.971 0 0
.25 .50 .75 .90 .95
2 3 4 6 9
lowest : 0 1 2 3 4, highest: 41 45 51 60 99
--------------------------------------------------------------------------------
EIC12
n missing distinct Info Mean Gmd .05 .10
10000 0 32 0.927 1.557 1.901 0 0
.25 .50 .75 .90 .95
0 1 2 4 5
lowest : 0 1 2 3 4, highest: 30 32 33 50 53
--------------------------------------------------------------------------------
EIC13
n missing distinct Info Mean Gmd .05 .10
10000 0 41 0.993 8.057 4.698 2.0 3.0
.25 .50 .75 .90 .95
5.0 7.5 10.0 13.0 16.0
lowest : 0 1 2 3 4, highest: 36 41 42 46 60
--------------------------------------------------------------------------------
EIC14
n missing distinct Info Mean Gmd .05 .10
10000 0 53 0.994 8.049 5.41 1 3
.25 .50 .75 .90 .95
5 7 10 14 17
lowest : 0 1 2 3 4, highest: 55 56 61 66 69
--------------------------------------------------------------------------------
EIC15
n missing distinct Info Mean Gmd .05 .10
10000 0 42 0.992 6.421 4.495 1 2
.25 .50 .75 .90 .95
4 6 8 12 15
lowest : 0 1 2 3 4, highest: 45 48 50 52 99
--------------------------------------------------------------------------------
EIC16
n missing distinct Info Mean Gmd .05 .10
10000 0 39 0.988 4.614 3.995 0 1
.25 .50 .75 .90 .95
2 4 6 9 12
lowest : 0 1 2 3 4, highest: 35 36 37 39 54
--------------------------------------------------------------------------------
OEDC1
n missing distinct Info Mean Gmd .05 .10
10000 0 38 0.993 7.039 4.506 1 2
.25 .50 .75 .90 .95
4 6 9 12 15
lowest : 0 1 2 3 4, highest: 37 38 43 44 57
--------------------------------------------------------------------------------
OEDC2
n missing distinct Info Mean Gmd .05 .10
10000 0 49 0.988 4.693 4.766 0 0
.25 .50 .75 .90 .95
2 3 6 10 14
lowest : 0 1 2 3 4, highest: 45 46 48 52 61
--------------------------------------------------------------------------------
OEDC3
n missing distinct Info Mean Gmd .05 .10
10000 0 45 0.976 3.232 3.599 0 0
.25 .50 .75 .90 .95
1 2 4 7 10
lowest : 0 1 2 3 4, highest: 41 44 46 49 50
--------------------------------------------------------------------------------
OEDC4
n missing distinct Info Mean Gmd .05 .10
10000 0 46 0.995 8.013 5.691 1 2
.25 .50 .75 .90 .95
4 7 10 15 18
lowest : 0 1 2 3 4, highest: 41 42 43 44 99
--------------------------------------------------------------------------------
OEDC5
n missing distinct Info Mean Gmd .05 .10
10000 0 77 0.999 69.45 12.64 50 56
.25 .50 .75 .90 .95
64 71 77 82 85
lowest : 0 19 21 24 26, highest: 94 95 96 97 99
--------------------------------------------------------------------------------
OEDC6
n missing distinct Info Mean Gmd .05 .10
10000 0 43 0.991 6.085 4.313 0 2
.25 .50 .75 .90 .95
3 5 8 11 13
lowest : 0 1 2 3 4, highest: 46 55 58 74 99
--------------------------------------------------------------------------------
OEDC7
n missing distinct Info Mean Gmd .05 .10
10000 0 15 0.672 0.5065 0.7859 0 0
.25 .50 .75 .90 .95
0 0 1 2 2
lowest : 0 1 2 3 4, highest: 10 11 12 13 49
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 6826 2130 651 226 75 35 19 10 14 4 4
Proportion 0.683 0.213 0.065 0.023 0.007 0.004 0.002 0.001 0.001 0.000 0.000
Value 11 12 13 49
Frequency 2 2 1 1
Proportion 0.000 0.000 0.000 0.000
--------------------------------------------------------------------------------
EC1
n missing distinct Info Mean Gmd .05 .10
10000 0 79 0.874 127.9 15.78 120 120
.25 .50 .75 .90 .95
120 120 140 151 160
lowest : 0 89 90 91 92, highest: 162 163 164 166 170
--------------------------------------------------------------------------------
EC2
n missing distinct Info Mean Gmd .05 .10
10000 0 63 0.997 8.786 8.451 0 1
.25 .50 .75 .90 .95
3 6 12 20 25
lowest : 0 1 2 3 4, highest: 61 62 67 68 78
--------------------------------------------------------------------------------
EC3
n missing distinct Info Mean Gmd .05 .10
10000 0 54 0.998 12.95 8.617 2 4
.25 .50 .75 .90 .95
7 12 18 23 26
lowest : 0 1 2 3 4, highest: 55 58 62 82 99
--------------------------------------------------------------------------------
EC4
n missing distinct Info Mean Gmd .05 .10
10000 0 65 0.999 28.74 11.56 11 15
.25 .50 .75 .90 .95
22 29 36 41 44
lowest : 0 1 2 3 4, highest: 60 61 63 71 99
--------------------------------------------------------------------------------
EC5
n missing distinct Info Mean Gmd .05 .10
10000 0 54 0.998 20.77 7.97 10 12
.25 .50 .75 .90 .95
16 21 25 30 32
lowest : 0 1 2 3 4, highest: 50 51 52 60 61
--------------------------------------------------------------------------------
EC6
n missing distinct Info Mean Gmd .05 .10
10000 0 25 0.991 6.522 3.706 2 2
.25 .50 .75 .90 .95
4 6 9 11 12
lowest : 0 1 2 3 4, highest: 20 21 22 23 30
--------------------------------------------------------------------------------
EC7
n missing distinct Info Mean Gmd .05 .10
10000 0 55 0.999 13.97 10.72 2 4
.25 .50 .75 .90 .95
6 12 20 28 33
lowest : 0 1 2 3 4, highest: 50 53 55 58 99
--------------------------------------------------------------------------------
EC8
n missing distinct Info Mean Gmd .05 .10
10000 0 56 0.995 7.33 7.086 0 1
.25 .50 .75 .90 .95
3 5 10 17 22
lowest : 0 1 2 3 4, highest: 58 59 67 75 99
--------------------------------------------------------------------------------
SEC1
n missing distinct Info Mean Gmd .05 .10
10000 0 44 0.983 3.73 3.397 0 1
.25 .50 .75 .90 .95
1 3 5 7 10
lowest : 0 1 2 3 4, highest: 59 63 68 73 74
--------------------------------------------------------------------------------
SEC2
n missing distinct Info Mean Gmd .05 .10
10000 0 76 0.998 22.22 8.287 9 13
.25 .50 .75 .90 .95
18 23 26 31 33
lowest : 0 1 2 3 4, highest: 84 85 90 95 99
--------------------------------------------------------------------------------
SEC3
n missing distinct Info Mean Gmd .05 .10
10000 0 15 0.94 1.894 1.453 0 0
.25 .50 .75 .90 .95
1 2 3 4 4
lowest : 0 1 2 3 4, highest: 10 11 12 13 19
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 1349 2886 3071 1576 684 286 84 38 11 7 2
Proportion 0.135 0.289 0.307 0.158 0.068 0.029 0.008 0.004 0.001 0.001 0.000
Value 11 12 13 19
Frequency 1 3 1 1
Proportion 0.000 0.000 0.000 0.000
--------------------------------------------------------------------------------
describe(data.val[,201:300])
data.val[, 201:300]
100 Variables 10000 Observations
--------------------------------------------------------------------------------
IC3
n missing distinct Info Mean Gmd .05 .10
10000 0 842 1 390.1 171.7 199 229
.25 .50 .75 .90 .95
280 358 472 599 700
lowest : 0 66 71 72 74, highest: 1187 1188 1193 1201 1394
--------------------------------------------------------------------------------
IC4
n missing distinct Info Mean Gmd .05 .10
10000 0 892 1 433.7 184 227 261
.25 .50 .75 .90 .95
316 402 518 658 765
lowest : 0 57 73 77 82, highest: 1242 1243 1259 1394 1500
--------------------------------------------------------------------------------
IC5
n missing distinct Info Mean Gmd .05 .10
10000 0 7411 1 15858 8028 7507 8755
.25 .50 .75 .90 .95
10831 13912 18450 24831 30452
lowest : 0 1972 2909 2970 3030, highest: 100528 112390 119336 119454 159306
--------------------------------------------------------------------------------
IC6
n missing distinct Info Mean Gmd .05 .10
10000 0 91 0.999 21.33 16.08 3 5
.25 .50 .75 .90 .95
10 19 30 41 48
lowest : 0 1 2 3 4, highest: 89 90 94 98 99
--------------------------------------------------------------------------------
IC7
n missing distinct Info Mean Gmd .05 .10
10000 0 55 0.998 16.79 8.87 4 6
.25 .50 .75 .90 .95
11 17 22 26 29
lowest : 0 1 2 3 4, highest: 54 55 57 58 99
--------------------------------------------------------------------------------
IC8
n missing distinct Info Mean Gmd .05 .10
10000 0 47 0.997 15.71 6.821 5 8
.25 .50 .75 .90 .95
12 16 19 23 26
lowest : 0 1 2 3 4, highest: 43 44 45 61 99
--------------------------------------------------------------------------------
IC9
n missing distinct Info Mean Gmd .05 .10
10000 0 53 0.998 18.44 8.237 6 9
.25 .50 .75 .90 .95
14 18 23 28 31
lowest : 0 1 2 3 4, highest: 50 52 53 67 99
--------------------------------------------------------------------------------
IC10
n missing distinct Info Mean Gmd .05 .10
10000 0 56 0.999 16.14 10.84 3 5
.25 .50 .75 .90 .95
9 15 23 30 33
lowest : 0 1 2 3 4, highest: 51 52 53 54 99
--------------------------------------------------------------------------------
IC11
n missing distinct Info Mean Gmd .05 .10
10000 0 40 0.991 5.673 6.061 0 0
.25 .50 .75 .90 .95
1 4 8 14 18
lowest : 0 1 2 3 4, highest: 35 36 37 39 47
--------------------------------------------------------------------------------
IC12
n missing distinct Info Mean Gmd .05 .10
10000 0 32 0.926 2.269 3.083 0 0
.25 .50 .75 .90 .95
0 1 3 7 9
lowest : 0 1 2 3 4, highest: 28 31 32 33 50
--------------------------------------------------------------------------------
IC13
n missing distinct Info Mean Gmd .05 .10
10000 0 21 0.731 0.9277 1.464 0 0
.25 .50 .75 .90 .95
0 0 1 3 4
lowest : 0 1 2 3 4, highest: 16 17 18 20 28
--------------------------------------------------------------------------------
IC14
n missing distinct Info Mean Gmd .05 .10
10000 0 54 0.835 1.897 3.014 0 0
.25 .50 .75 .90 .95
0 0 2 5 9
lowest : 0 1 2 3 4, highest: 54 58 59 60 70
--------------------------------------------------------------------------------
IC15
n missing distinct Info Mean Gmd .05 .10
10000 0 87 0.999 14.73 13.28 0 2
.25 .50 .75 .90 .95
5 12 21 31 39
lowest : 0 1 2 3 4, highest: 86 87 88 92 99
--------------------------------------------------------------------------------
IC16
n missing distinct Info Mean Gmd .05 .10
10000 0 61 0.999 15.79 10.22 2 4
.25 .50 .75 .90 .95
9 16 22 27 31
lowest : 0 1 2 3 4, highest: 58 60 67 68 99
--------------------------------------------------------------------------------
IC17
n missing distinct Info Mean Gmd .05 .10
10000 0 55 0.998 16.31 8.315 4 6
.25 .50 .75 .90 .95
11 17 21 25 28
lowest : 0 1 2 3 4, highest: 52 53 59 62 99
--------------------------------------------------------------------------------
IC18
n missing distinct Info Mean Gmd .05 .10
10000 0 60 0.999 20.71 9.33 7 10
.25 .50 .75 .90 .95
15 21 26 31 34
lowest : 0 1 2 3 4, highest: 56 57 62 67 99
--------------------------------------------------------------------------------
IC19
n missing distinct Info Mean Gmd .05 .10
10000 0 60 0.999 18.86 11.96 3 6
.25 .50 .75 .90 .95
11 18 26 33 37
lowest : 0 1 2 3 4, highest: 55 56 58 59 68
--------------------------------------------------------------------------------
IC20
n missing distinct Info Mean Gmd .05 .10
10000 0 45 0.992 6.697 7.096 0 0
.25 .50 .75 .90 .95
2 4 10 17 21
lowest : 0 1 2 3 4, highest: 40 41 43 49 64
--------------------------------------------------------------------------------
IC21
n missing distinct Info Mean Gmd .05 .10
10000 0 34 0.923 2.643 3.627 0 0
.25 .50 .75 .90 .95
0 1 4 8 11
lowest : 0 1 2 3 4, highest: 29 30 32 40 50
--------------------------------------------------------------------------------
IC22
n missing distinct Info Mean Gmd .05 .10
10000 0 23 0.735 1.124 1.779 0 0
.25 .50 .75 .90 .95
0 0 1 4 6
lowest : 0 1 2 3 4, highest: 18 19 20 24 30
--------------------------------------------------------------------------------
IC23
n missing distinct Info Mean Gmd .05 .10
10000 0 62 0.831 2.318 3.727 0 0
.25 .50 .75 .90 .95
0 0 2 6 11
lowest : 0 1 2 3 4, highest: 63 65 66 70 99
--------------------------------------------------------------------------------
HHAS1
n missing distinct Info Mean Gmd .05 .10
10000 0 94 0.999 26.96 15.21 7 10
.25 .50 .75 .90 .95
17 26 35 43 51
lowest : 0 1 2 3 4, highest: 91 92 93 95 99
--------------------------------------------------------------------------------
HHAS2
n missing distinct Info Mean Gmd .05 .10
10000 0 54 0.993 6.069 5.979 0 0
.25 .50 .75 .90 .95
2 4 8 13 18
lowest : 0 1 2 3 4, highest: 53 54 58 62 80
--------------------------------------------------------------------------------
HHAS3
n missing distinct Info Mean Gmd .05 .10
10000 0 95 1 43.21 19.35 16 22
.25 .50 .75 .90 .95
31 43 55 66 72
lowest : 0 1 2 3 4, highest: 91 92 93 94 99
--------------------------------------------------------------------------------
HHAS4
n missing distinct Info Mean Gmd .05 .10
10000 0 72 0.998 10.57 10.02 1 1
.25 .50 .75 .90 .95
4 8 14 23 30
lowest : 0 1 2 3 4, highest: 72 76 77 80 91
--------------------------------------------------------------------------------
MC1
n missing distinct Info Mean Gmd .05 .10
10000 0 94 1 48.15 17.63 26 30
.25 .50 .75 .90 .95
37 46 58 69 77
lowest : 0 6 8 9 10, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
MC2
n missing distinct Info Mean Gmd .05 .10
10000 0 95 1 51.08 17.68 22 30
.25 .50 .75 .90 .95
42 53 62 70 73
lowest : 0 1 2 3 4, highest: 90 91 92 94 99
--------------------------------------------------------------------------------
MC3
n missing distinct Info Mean Gmd .05 .10
10000 0 75 0.998 12.23 10.42 1 2
.25 .50 .75 .90 .95
5 9 16 25 32
lowest : 0 1 2 3 4, highest: 79 81 82 88 96
--------------------------------------------------------------------------------
TPE1
n missing distinct Info Mean Gmd .05 .10
10000 0 90 0.999 76.09 12.93 53 63
.25 .50 .75 .90 .95
72 79 84 88 91
lowest : 0 3 6 8 9, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
TPE2
n missing distinct Info Mean Gmd .05 .10
10000 0 53 0.997 13.02 7.228 4 5
.25 .50 .75 .90 .95
9 12 17 21 25
lowest : 0 1 2 3 4, highest: 49 52 58 61 66
--------------------------------------------------------------------------------
TPE3
n missing distinct Info Mean Gmd .05 .10
10000 0 57 0.851 2.51 4.017 0 0
.25 .50 .75 .90 .95
0 0 2 7 13
lowest : 0 1 2 3 4, highest: 54 55 61 62 71
--------------------------------------------------------------------------------
TPE4
n missing distinct Info Mean Gmd .05 .10
10000 0 52 0.796 1.945 3.191 0 0
.25 .50 .75 .90 .95
0 0 2 5 10
lowest : 0 1 2 3 4, highest: 48 52 55 61 71
--------------------------------------------------------------------------------
TPE5
n missing distinct Info Mean Gmd .05 .10
10000 0 31 0.237 0.4527 0.8653 0 0
.25 .50 .75 .90 .95
0 0 0 0 3
lowest : 0 1 2 3 4, highest: 26 32 33 35 38
--------------------------------------------------------------------------------
TPE6
n missing distinct Info Mean Gmd .05 .10
10000 0 15 0.15 0.1017 0.1967 0 0
.25 .50 .75 .90 .95
0 0 0 0 1
lowest : 0 1 2 3 4, highest: 11 12 13 15 17
Value 0 1 2 3 4 5 6 7 8 9 11
Frequency 9473 321 115 39 20 8 7 3 2 3 1
Proportion 0.947 0.032 0.012 0.004 0.002 0.001 0.001 0.000 0.000 0.000 0.000
Value 12 13 15 17
Frequency 2 3 1 2
Proportion 0.000 0.000 0.000 0.000
--------------------------------------------------------------------------------
TPE7
n missing distinct Info Mean Gmd .05 .10
10000 0 10 0.402 0.2358 0.4162 0 0
.25 .50 .75 .90 .95
0 0 0 1 1
lowest : 0 1 2 3 4, highest: 5 6 7 9 12
Value 0 1 2 3 4 5 6 7 9 12
Frequency 8418 1098 316 99 43 13 4 7 1 1
Proportion 0.842 0.110 0.032 0.010 0.004 0.001 0.000 0.001 0.000 0.000
--------------------------------------------------------------------------------
TPE8
n missing distinct Info Mean Gmd .05 .10
10000 0 67 0.983 4.041 4.519 0 0
.25 .50 .75 .90 .95
1 3 5 9 12
lowest : 0 1 2 3 4, highest: 67 69 71 91 99
--------------------------------------------------------------------------------
TPE9
n missing distinct Info Mean Gmd .05 .10
10000 0 37 0.98 3.304 3.34 0 0
.25 .50 .75 .90 .95
1 2 4 7 9
lowest : 0 1 2 3 4, highest: 35 39 43 83 99
--------------------------------------------------------------------------------
PEC1
n missing distinct Info Mean Gmd .05 .10
10000 0 69 0.883 2.14 3.366 0 0
.25 .50 .75 .90 .95
0 1 2 4 9
lowest : 0 1 2 3 4, highest: 66 67 70 72 79
--------------------------------------------------------------------------------
PEC2
n missing distinct Info Mean Gmd .05 .10
10000 0 96 0.999 18.38 19.73 0 1
.25 .50 .75 .90 .95
4 11 27 47 60
lowest : 0 1 2 3 4, highest: 92 93 94 96 99
--------------------------------------------------------------------------------
TPE10
n missing distinct Info Mean Gmd .05 .10
10000 0 53 0.997 19.52 6.958 10 12
.25 .50 .75 .90 .95
15 19 23 28 31
lowest : 0 2 3 4 5, highest: 51 52 53 59 90
--------------------------------------------------------------------------------
TPE11
n missing distinct Info Mean Gmd .05 .10
10000 0 52 0.997 23.81 7.124 14 16
.25 .50 .75 .90 .95
20 23 28 32 35
lowest : 0 3 5 6 7, highest: 50 51 52 54 60
--------------------------------------------------------------------------------
TPE12
n missing distinct Info Mean Gmd .05 .10
10000 0 44 0.991 5.296 5.086 0 0
.25 .50 .75 .90 .95
2 4 7 12 15
lowest : 0 1 2 3 4, highest: 39 44 47 48 51
--------------------------------------------------------------------------------
TPE13
n missing distinct Info Mean Gmd .05 .10
10000 0 93 1 60.45 18.95 27 36
.25 .50 .75 .90 .95
51 64 73 79 83
lowest : 0 5 7 8 9, highest: 93 94 95 96 99
--------------------------------------------------------------------------------
LFC1
n missing distinct Info Mean Gmd .05 .10
10000 0 96 0.999 64.57 14.07 41.95 50.00
.25 .50 .75 .90 .95
59.00 66.00 73.00 79.00 82.00
lowest : 0 2 4 5 6, highest: 93 94 95 96 99
--------------------------------------------------------------------------------
LFC2
n missing distinct Info Mean Gmd .05 .10
10000 0 97 0.999 73.89 14.6 49 58
.25 .50 .75 .90 .95
69 76 83 88 91
lowest : 0 3 4 5 6, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
LFC3
n missing distinct Info Mean Gmd .05 .10
10000 0 92 0.999 56.03 14.85 33 40
.25 .50 .75 .90 .95
49 57 65 72 76
lowest : 0 4 5 6 7, highest: 90 92 93 95 99
--------------------------------------------------------------------------------
LFC4
n missing distinct Info Mean Gmd .05 .10
10000 0 98 0.999 69.89 15.38 43 53
.25 .50 .75 .90 .95
64 72 79 85 88
lowest : 0 2 3 4 5, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
LFC5
n missing distinct Info Mean Gmd .05 .10
10000 0 93 0.999 53.02 15.06 29.95 37.00
.25 .50 .75 .90 .95
46.00 54.00 62.00 69.00 73.00
lowest : 0 2 4 5 6, highest: 90 91 92 94 99
--------------------------------------------------------------------------------
LFC6
n missing distinct Info Mean Gmd .05 .10
10000 0 91 0.999 64.19 16.99 37 46
.25 .50 .75 .90 .95
57 66 74 81 86
lowest : 0 8 11 12 13, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
LFC7
n missing distinct Info Mean Gmd .05 .10
10000 0 98 1 48.88 18.79 17 27
.25 .50 .75 .90 .95
40 50 60 68 74
lowest : 0 1 2 3 4, highest: 93 94 95 96 99
--------------------------------------------------------------------------------
LFC8
n missing distinct Info Mean Gmd .05 .10
10000 0 98 0.971 70.13 33 0 11
.25 .50 .75 .90 .95
56 78 99 99 99
lowest : 0 3 4 5 6, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
LFC9
n missing distinct Info Mean Gmd .05 .10
10000 0 91 0.848 62.85 45.32 0 0
.25 .50 .75 .90 .95
0 97 99 99 99
lowest : 0 4 6 11 13, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
LFC10
n missing distinct Info Mean Gmd .05 .10
10000 0 76 0.972 6.672 8.392 0 0
.25 .50 .75 .90 .95
0 4 9 17 24
lowest : 0 1 2 3 4, highest: 80 82 90 96 99
--------------------------------------------------------------------------------
OCC1
n missing distinct Info Mean Gmd .05 .10
10000 0 61 0.998 14.3 9.052 3 5
.25 .50 .75 .90 .95
9 13 19 26 30
lowest : 0 1 2 3 4, highest: 57 58 62 78 99
--------------------------------------------------------------------------------
OCC2
n missing distinct Info Mean Gmd .05 .10
10000 0 50 0.998 12.69 7.753 3 5
.25 .50 .75 .90 .95
8 11 17 22 26
lowest : 0 1 2 3 4, highest: 46 50 51 59 60
--------------------------------------------------------------------------------
OCC3
n missing distinct Info Mean Gmd .05 .10
10000 0 25 0.98 3.582 2.703 0 0
.25 .50 .75 .90 .95
2 3 5 7 8
lowest : 0 1 2 3 4, highest: 24 30 33 34 99
--------------------------------------------------------------------------------
OCC4
n missing distinct Info Mean Gmd .05 .10
10000 0 48 0.996 12.44 6.039 4 6
.25 .50 .75 .90 .95
9 12 15 19 22
lowest : 0 1 2 3 4, highest: 56 60 62 66 99
--------------------------------------------------------------------------------
OCC5
n missing distinct Info Mean Gmd .05 .10
10000 0 53 0.996 15.59 6.23 7 9
.25 .50 .75 .90 .95
12 15 19 23 25
lowest : 0 1 2 3 4, highest: 58 62 64 74 99
--------------------------------------------------------------------------------
OCC6
n missing distinct Info Mean Gmd .05 .10
10000 0 14 0.603 0.4315 0.7004 0 0
.25 .50 .75 .90 .95
0 0 1 1 2
lowest : 0 1 2 3 4, highest: 9 10 12 15 20
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 7309 1837 508 177 73 36 21 18 11 2 3
Proportion 0.731 0.184 0.051 0.018 0.007 0.004 0.002 0.002 0.001 0.000 0.000
Value 12 15 20
Frequency 3 1 1
Proportion 0.000 0.000 0.000
--------------------------------------------------------------------------------
OCC7
n missing distinct Info Mean Gmd .05 .10
10000 0 22 0.945 1.666 1.782 0 0
.25 .50 .75 .90 .95
0 1 2 4 5
lowest : 0 1 2 3 4, highest: 18 21 24 31 38
--------------------------------------------------------------------------------
OCC8
n missing distinct Info Mean Gmd .05 .10
10000 0 54 0.996 10.52 6.419 3 4
.25 .50 .75 .90 .95
7 10 13 18 22
lowest : 0 1 2 3 4, highest: 51 52 54 59 99
--------------------------------------------------------------------------------
OCC9
n missing distinct Info Mean Gmd .05 .10
10000 0 55 0.941 2.781 3.877 0 0
.25 .50 .75 .90 .95
0 1 3 7 11
lowest : 0 1 2 3 4, highest: 60 65 69 70 77
--------------------------------------------------------------------------------
OCC10
n missing distinct Info Mean Gmd .05 .10
10000 0 42 0.997 11.33 6.492 2 4
.25 .50 .75 .90 .95
7 11 15 19 21
lowest : 0 1 2 3 4, highest: 37 38 39 41 46
--------------------------------------------------------------------------------
OCC11
n missing distinct Info Mean Gmd .05 .10
10000 0 42 0.994 6.293 6.029 0 0
.25 .50 .75 .90 .95
2 5 9 14 18
lowest : 0 1 2 3 4, highest: 40 41 42 60 99
--------------------------------------------------------------------------------
OCC12
n missing distinct Info Mean Gmd .05 .10
10000 0 29 0.988 3.947 3.347 0 0
.25 .50 .75 .90 .95
2 3 6 8 9
lowest : 0 1 2 3 4, highest: 24 25 26 27 34
--------------------------------------------------------------------------------
OCC13
n missing distinct Info Mean Gmd .05 .10
10000 0 28 0.985 3.674 3.063 0 0
.25 .50 .75 .90 .95
2 3 5 7 9
lowest : 0 1 2 3 4, highest: 24 29 31 32 38
--------------------------------------------------------------------------------
EIC1
n missing distinct Info Mean Gmd .05 .10
10000 0 60 0.955 3.083 4.208 0 0
.25 .50 .75 .90 .95
0 1 3 8 13
lowest : 0 1 2 3 4, highest: 66 72 73 81 83
--------------------------------------------------------------------------------
EIC2
n missing distinct Info Mean Gmd .05 .10
10000 0 34 0.525 0.6919 1.235 0 0
.25 .50 .75 .90 .95
0 0 0 2 4
lowest : 0 1 2 3 4, highest: 31 33 34 36 49
--------------------------------------------------------------------------------
EIC3
n missing distinct Info Mean Gmd .05 .10
10000 0 36 0.992 6.402 4.3 0 2
.25 .50 .75 .90 .95
4 6 9 11 13
lowest : 0 1 2 3 4, highest: 32 34 35 36 99
--------------------------------------------------------------------------------
EIC4
n missing distinct Info Mean Gmd .05 .10
10000 0 62 0.999 16.98 11.11 3 6
.25 .50 .75 .90 .95
9 15 23 31 36
lowest : 0 1 2 3 4, highest: 59 60 63 71 99
--------------------------------------------------------------------------------
EIC5
n missing distinct Info Mean Gmd .05 .10
10000 0 32 0.985 4.317 3.301 0 1
.25 .50 .75 .90 .95
2 4 6 8 10
lowest : 0 1 2 3 4, highest: 28 29 30 32 34
--------------------------------------------------------------------------------
EIC6
n missing distinct Info Mean Gmd .05 .10
10000 0 21 0.974 2.696 2.33 0 0
.25 .50 .75 .90 .95
1 2 4 5 7
lowest : 0 1 2 3 4, highest: 17 18 21 22 24
--------------------------------------------------------------------------------
EIC7
n missing distinct Info Mean Gmd .05 .10
10000 0 27 0.987 4.351 3.221 0 1
.25 .50 .75 .90 .95
2 4 6 8 10
lowest : 0 1 2 3 4, highest: 22 23 24 27 31
--------------------------------------------------------------------------------
EIC8
n missing distinct Info Mean Gmd .05 .10
10000 0 54 0.997 16.84 6.632 8 10
.25 .50 .75 .90 .95
13 16 20 24 27
lowest : 0 1 2 3 4, highest: 50 54 60 66 99
--------------------------------------------------------------------------------
EIC9
n missing distinct Info Mean Gmd .05 .10
10000 0 40 0.994 6.939 5.003 1 2
.25 .50 .75 .90 .95
4 6 9 13 15
lowest : 0 1 2 3 4, highest: 35 37 41 56 99
--------------------------------------------------------------------------------
EIC10
n missing distinct Info Mean Gmd .05 .10
10000 0 30 0.986 4.721 3.187 0 1
.25 .50 .75 .90 .95
3 4 6 8 10
lowest : 0 1 2 3 4, highest: 25 26 27 29 31
--------------------------------------------------------------------------------
EIC11
n missing distinct Info Mean Gmd .05 .10
10000 0 36 0.976 3.284 2.948 0 0
.25 .50 .75 .90 .95
2 3 4 7 9
lowest : 0 1 2 3 4, highest: 35 36 37 44 99
--------------------------------------------------------------------------------
EIC12
n missing distinct Info Mean Gmd .05 .10
10000 0 26 0.929 1.538 1.84 0 0
.25 .50 .75 .90 .95
0 1 2 4 5
lowest : 0 1 2 3 4, highest: 23 24 25 27 28
--------------------------------------------------------------------------------
EIC13
n missing distinct Info Mean Gmd .05 .10
10000 0 40 0.993 8.045 4.677 2 3
.25 .50 .75 .90 .95
5 7 10 13 16
lowest : 0 1 2 3 4, highest: 36 42 51 60 99
--------------------------------------------------------------------------------
EIC14
n missing distinct Info Mean Gmd .05 .10
10000 0 52 0.994 8.02 5.277 1 3
.25 .50 .75 .90 .95
5 7 10 14 17
lowest : 0 1 2 3 4, highest: 54 56 57 58 66
--------------------------------------------------------------------------------
EIC15
n missing distinct Info Mean Gmd .05 .10
10000 0 47 0.992 6.639 4.562 1 2
.25 .50 .75 .90 .95
4 6 8 12 15
lowest : 0 1 2 3 4, highest: 59 61 65 76 78
--------------------------------------------------------------------------------
EIC16
n missing distinct Info Mean Gmd .05 .10
10000 0 41 0.988 4.656 4.05 0 1
.25 .50 .75 .90 .95
2 4 6 9 12
lowest : 0 1 2 3 4, highest: 36 38 40 47 65
--------------------------------------------------------------------------------
OEDC1
n missing distinct Info Mean Gmd .05 .10
10000 0 38 0.992 7.03 4.438 1 2
.25 .50 .75 .90 .95
4 6 9 12 15
lowest : 0 1 2 3 4, highest: 33 34 43 60 65
--------------------------------------------------------------------------------
OEDC2
n missing distinct Info Mean Gmd .05 .10
10000 0 46 0.988 4.661 4.688 0 0
.25 .50 .75 .90 .95
2 3 6 10 14
lowest : 0 1 2 3 4, highest: 41 46 47 49 51
--------------------------------------------------------------------------------
OEDC3
n missing distinct Info Mean Gmd .05 .10
10000 0 49 0.977 3.241 3.556 0 0
.25 .50 .75 .90 .95
1 2 4 7 10
lowest : 0 1 2 3 4, highest: 46 49 51 67 99
--------------------------------------------------------------------------------
OEDC4
n missing distinct Info Mean Gmd .05 .10
10000 0 51 0.995 8.056 5.7 1 2
.25 .50 .75 .90 .95
5 7 10 15 18
lowest : 0 1 2 3 4, highest: 56 61 65 84 99
--------------------------------------------------------------------------------
OEDC5
n missing distinct Info Mean Gmd .05 .10
10000 0 79 0.999 69.59 12.37 50 56
.25 .50 .75 .90 .95
64 71 77 82 85
lowest : 0 13 19 23 24, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
OEDC6
n missing distinct Info Mean Gmd .05 .10
10000 0 47 0.991 6.142 4.364 1 2
.25 .50 .75 .90 .95
3 5 8 11 14
lowest : 0 1 2 3 4, highest: 48 49 52 81 99
--------------------------------------------------------------------------------
OEDC7
n missing distinct Info Mean Gmd .05 .10
10000 0 13 0.687 0.5007 0.7592 0 0
.25 .50 .75 .90 .95
0 0 1 2 2
lowest : 0 1 2 3 4, highest: 8 9 11 12 13
Value 0 1 2 3 4 5 6 7 8 9 11
Frequency 6708 2238 683 216 90 37 12 9 1 2 1
Proportion 0.671 0.224 0.068 0.022 0.009 0.004 0.001 0.001 0.000 0.000 0.000
Value 12 13
Frequency 1 2
Proportion 0.000 0.000
--------------------------------------------------------------------------------
EC1
n missing distinct Info Mean Gmd .05 .10
10000 0 76 0.882 128.4 15.56 120 120
.25 .50 .75 .90 .95
120 120 140 152 160
lowest : 0 89 90 91 92, highest: 159 160 161 162 170
--------------------------------------------------------------------------------
EC2
n missing distinct Info Mean Gmd .05 .10
10000 0 66 0.997 8.739 8.421 0 1
.25 .50 .75 .90 .95
3 6 12 19 24
lowest : 0 1 2 3 4, highest: 71 72 73 82 99
--------------------------------------------------------------------------------
EC3
n missing distinct Info Mean Gmd .05 .10
10000 0 49 0.998 12.72 8.396 2 4
.25 .50 .75 .90 .95
7 12 18 23 26
lowest : 0 1 2 3 4, highest: 45 47 53 55 67
--------------------------------------------------------------------------------
EC4
n missing distinct Info Mean Gmd .05 .10
10000 0 67 0.999 28.74 11.6 11 15
.25 .50 .75 .90 .95
22 29 36 42 45
lowest : 0 1 2 3 4, highest: 62 63 64 65 99
--------------------------------------------------------------------------------
EC5
n missing distinct Info Mean Gmd .05 .10
10000 0 52 0.998 20.75 7.817 10 12
.25 .50 .75 .90 .95
16 21 25 30 32
lowest : 0 1 2 3 4, highest: 47 48 53 54 56
--------------------------------------------------------------------------------
EC6
n missing distinct Info Mean Gmd .05 .10
10000 0 28 0.991 6.585 3.771 2 2
.25 .50 .75 .90 .95
4 6 9 11 13
lowest : 0 1 2 3 4, highest: 24 25 26 31 33
--------------------------------------------------------------------------------
EC7
n missing distinct Info Mean Gmd .05 .10
10000 0 56 0.999 14.28 10.85 2 4
.25 .50 .75 .90 .95
7 12 20 29 34
lowest : 0 1 2 3 4, highest: 51 52 53 57 65
--------------------------------------------------------------------------------
EC8
n missing distinct Info Mean Gmd .05 .10
10000 0 53 0.995 7.414 7.063 0 1
.25 .50 .75 .90 .95
3 5 10 17 22
lowest : 0 1 2 3 4, highest: 50 51 54 58 59
--------------------------------------------------------------------------------
SEC1
n missing distinct Info Mean Gmd .05 .10
10000 0 45 0.983 3.744 3.332 0.00 1.00
.25 .50 .75 .90 .95
1.00 3.00 5.00 7.00 9.05
lowest : 0 1 2 3 4, highest: 52 55 57 71 90
--------------------------------------------------------------------------------
SEC2
n missing distinct Info Mean Gmd .05 .10
10000 0 70 0.998 21.94 8.187 8 12
.25 .50 .75 .90 .95
18 22 26 30 33
lowest : 0 1 2 3 4, highest: 75 76 80 90 99
--------------------------------------------------------------------------------
SEC3
n missing distinct Info Mean Gmd .05 .10
10000 0 12 0.941 1.896 1.439 0 0
.25 .50 .75 .90 .95
1 2 3 4 4
lowest : 0 1 2 3 4, highest: 7 8 9 10 11
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 1323 2909 3008 1659 673 286 93 31 9 6 2
Proportion 0.132 0.291 0.301 0.166 0.067 0.029 0.009 0.003 0.001 0.001 0.000
Value 11
Frequency 1
Proportion 0.000
--------------------------------------------------------------------------------
# Pas dans les 300 premières
describe(data.train[,301:400])
data.train[, 301:400]
100 Variables 10000 Observations
--------------------------------------------------------------------------------
SEC4
n missing distinct Info Mean Gmd .05 .10
10000 0 45 0.997 17.23 7.026 5 9
.25 .50 .75 .90 .95
14 18 21 25 27
lowest : 0 1 2 3 4, highest: 40 42 44 48 51
--------------------------------------------------------------------------------
SEC5
n missing distinct Info Mean Gmd .05 .10
10000 0 74 0.991 6.835 4.732 2 2
.25 .50 .75 .90 .95
4 6 8 11 14
lowest : 0 1 2 3 4, highest: 81 86 92 97 99
--------------------------------------------------------------------------------
AFC1
n missing distinct Info Mean Gmd .05 .10
10000 0 39 0.447 0.5791 1.07 0 0
.25 .50 .75 .90 .95
0 0 0 1 2
lowest : 0 1 2 3 4, highest: 71 74 78 84 91
--------------------------------------------------------------------------------
AFC2
n missing distinct Info Mean Gmd .05 .10
10000 0 55 0.542 1.025 1.867 0 0
.25 .50 .75 .90 .95
0 0 0 2 4
lowest : 0 1 2 3 4, highest: 90 92 93 94 97
--------------------------------------------------------------------------------
AFC3
n missing distinct Info Mean Gmd .05 .10
10000 0 22 0.16 0.1475 0.2866 0 0
.25 .50 .75 .90 .95
0 0 0 0 1
lowest : 0 1 2 3 4, highest: 21 36 48 54 78
--------------------------------------------------------------------------------
AFC4
n missing distinct Info Mean Gmd .05 .10
10000 0 46 0.996 15.58 5.821 7 9
.25 .50 .75 .90 .95
12 15 19 22 24
lowest : 0 1 2 3 4, highest: 41 44 46 50 53
--------------------------------------------------------------------------------
AFC5
n missing distinct Info Mean Gmd .05 .10
10000 0 83 0.999 31.33 11.61 15 19
.25 .50 .75 .90 .95
25 31 37 44 49
lowest : 0 1 2 3 4, highest: 82 83 84 85 99
--------------------------------------------------------------------------------
AFC6
n missing distinct Info Mean Gmd .05 .10
10000 0 21 0.908 1.262 1.545 0 0
.25 .50 .75 .90 .95
0 1 2 3 4
lowest : 0 1 2 3 4, highest: 16 17 20 23 27
--------------------------------------------------------------------------------
VC1
n missing distinct Info Mean Gmd .05 .10
10000 0 91 0.999 30.62 16.2 7 13
.25 .50 .75 .90 .95
21 30 39 49 56
lowest : 0 1 2 3 4, highest: 88 90 91 93 99
--------------------------------------------------------------------------------
VC2
n missing distinct Info Mean Gmd .05 .10
10000 0 68 0.998 17.92 10.6 0 6
.25 .50 .75 .90 .95
12 17 23 30 35
lowest : 0 1 2 3 4, highest: 68 74 83 88 99
--------------------------------------------------------------------------------
VC3
n missing distinct Info Mean Gmd .05 .10
10000 0 98 1 33.21 18.87 7 12
.25 .50 .75 .90 .95
22 32 43 55 63
lowest : 0 1 2 3 4, highest: 94 95 96 97 99
--------------------------------------------------------------------------------
VC4
n missing distinct Info Mean Gmd .05 .10
10000 0 80 0.994 11.32 10.84 0 0
.25 .50 .75 .90 .95
4 9 16 24 31
lowest : 0 1 2 3 4, highest: 85 86 91 92 99
--------------------------------------------------------------------------------
ANC1
n missing distinct Info Mean Gmd .05 .10
10000 0 38 0.772 0.7341 1.091 0 0
.25 .50 .75 .90 .95
0 0 1 2 2
lowest : 0 1 2 3 4, highest: 47 48 50 54 61
--------------------------------------------------------------------------------
ANC2
n missing distinct Info Mean Gmd .05 .10
10000 0 37 0.993 5.457 4.426 0 1
.25 .50 .75 .90 .95
2 5 8 11 13
lowest : 0 1 2 3 4, highest: 33 41 43 49 50
--------------------------------------------------------------------------------
ANC3
n missing distinct Info Mean Gmd .05 .10
10000 0 21 0.847 0.8512 1.063 0 0
.25 .50 .75 .90 .95
0 1 1 2 3
lowest : 0 1 2 3 4, highest: 17 18 19 21 26
--------------------------------------------------------------------------------
ANC4
n missing distinct Info Mean Gmd .05 .10
10000 0 69 0.997 10.16 8.11 1 2
.25 .50 .75 .90 .95
5 8 13 19 26
lowest : 0 1 2 3 4, highest: 66 68 70 72 75
--------------------------------------------------------------------------------
ANC5
n missing distinct Info Mean Gmd .05 .10
10000 0 13 0.324 0.1802 0.328 0 0
.25 .50 .75 .90 .95
0 0 0 1 1
lowest : 0 1 2 3 4, highest: 8 9 10 12 13
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 8772 928 174 64 31 17 1 3 3 1 4
Proportion 0.877 0.093 0.017 0.006 0.003 0.002 0.000 0.000 0.000 0.000 0.000
Value 12 13
Frequency 1 1
Proportion 0.000 0.000
--------------------------------------------------------------------------------
ANC6
n missing distinct Info Mean Gmd .05 .10
10000 0 10 0.341 0.1693 0.3032 0 0
.25 .50 .75 .90 .95
0 0 0 1 1
lowest : 0 1 2 3 4, highest: 5 6 7 8 9
Value 0 1 2 3 4 5 6 7 8 9
Frequency 8696 1051 178 40 23 5 3 2 1 1
Proportion 0.870 0.105 0.018 0.004 0.002 0.000 0.000 0.000 0.000 0.000
--------------------------------------------------------------------------------
ANC7
n missing distinct Info Mean Gmd .05 .10
10000 0 33 0.989 4.838 3.548 0 1
.25 .50 .75 .90 .95
3 4 7 9 11
lowest : 0 1 2 3 4, highest: 33 37 45 48 99
--------------------------------------------------------------------------------
ANC8
n missing distinct Info Mean Gmd .05 .10
10000 0 31 0.936 1.79 2.216 0 0
.25 .50 .75 .90 .95
0 1 2 4 6
lowest : 0 1 2 3 4, highest: 27 31 34 40 45
--------------------------------------------------------------------------------
ANC9
n missing distinct Info Mean Gmd .05 .10
10000 0 36 0.707 0.7888 1.284 0 0
.25 .50 .75 .90 .95
0 0 1 2 3
lowest : 0 1 2 3 4, highest: 35 44 47 49 54
--------------------------------------------------------------------------------
ANC10
n missing distinct Info Mean Gmd .05 .10
10000 0 39 0.858 1.304 1.914 0 0
.25 .50 .75 .90 .95
0 1 1 3 5
lowest : 0 1 2 3 4, highest: 37 39 44 50 65
--------------------------------------------------------------------------------
ANC11
n missing distinct Info Mean Gmd .05 .10
10000 0 19 0.239 0.1627 0.3078 0 0
.25 .50 .75 .90 .95
0 0 0 0 1
lowest : 0 1 2 3 4, highest: 17 19 20 25 31
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 9129 574 147 63 27 25 16 3 5 1 1
Proportion 0.913 0.057 0.015 0.006 0.003 0.002 0.002 0.000 0.000 0.000 0.000
Value 11 12 15 17 19 20 25 31
Frequency 2 1 1 1 1 1 1 1
Proportion 0.000 0.000 0.000 0.000 0.000 0.000 0.000 0.000
--------------------------------------------------------------------------------
ANC12
n missing distinct Info Mean Gmd .05 .10
10000 0 21 0.436 0.3829 0.6918 0 0
.25 .50 .75 .90 .95
0 0 0 1 2
lowest : 0 1 2 3 4, highest: 16 17 19 30 52
--------------------------------------------------------------------------------
ANC13
n missing distinct Info Mean Gmd .05 .10
10000 0 14 0.828 0.726 0.9066 0 0
.25 .50 .75 .90 .95
0 1 1 2 2
lowest : 0 1 2 3 4, highest: 9 10 11 17 50
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 4998 3588 950 280 110 42 17 6 3 1 1
Proportion 0.500 0.359 0.095 0.028 0.011 0.004 0.002 0.001 0.000 0.000 0.000
Value 11 17 50
Frequency 1 2 1
Proportion 0.000 0.000 0.000
--------------------------------------------------------------------------------
ANC14
n missing distinct Info Mean Gmd .05 .10
10000 0 20 0.79 0.7226 1.025 0 0
.25 .50 .75 .90 .95
0 0 1 2 3
lowest : 0 1 2 3 4, highest: 16 18 21 23 25
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 5725 2798 847 309 149 73 36 22 13 4 2
Proportion 0.573 0.280 0.085 0.031 0.015 0.007 0.004 0.002 0.001 0.000 0.000
Value 11 12 14 15 16 18 21 23 25
Frequency 6 6 2 1 3 1 1 1 1
Proportion 0.001 0.001 0.000 0.000 0.000 0.000 0.000 0.000 0.000
--------------------------------------------------------------------------------
ANC15
n missing distinct Info Mean Gmd
10000 0 8 0.164 0.0777 0.1485
lowest : 0 1 2 3 4, highest: 3 4 5 6 32
Value 0 1 2 3 4 5 6 32
Frequency 9421 475 70 13 12 5 3 1
Proportion 0.942 0.048 0.007 0.001 0.001 0.000 0.000 0.000
--------------------------------------------------------------------------------
POBC1
n missing distinct Info Mean Gmd .05 .10
10000 0 81 0.988 6.645 8.429 0 0
.25 .50 .75 .90 .95
1 3 8 17 26
lowest : 0 1 2 3 4, highest: 78 79 80 83 86
--------------------------------------------------------------------------------
POBC2
n missing distinct Info Mean Gmd .05 .10
10000 0 100 1 57.07 24.5 17 27
.25 .50 .75 .90 .95
43 59 75 84 87
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
LSC1
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.995 87.75 14.11 54 71
.25 .50 .75 .90 .95
86 94 97 98 99
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
LSC2
n missing distinct Info Mean Gmd .05 .10
10000 0 99 0.975 6.077 8.759 0.00 0.00
.25 .50 .75 .90 .95
1.00 2.00 5.00 15.00 27.05
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
LSC3
n missing distinct Info Mean Gmd .05 .10
10000 0 53 0.785 1.817 2.997 0 0
.25 .50 .75 .90 .95
0 0 2 5 9
lowest : 0 1 2 3 4, highest: 51 55 56 59 66
--------------------------------------------------------------------------------
LSC4
n missing distinct Info Mean Gmd .05 .10
10000 0 53 0.977 3.399 3.623 0 0
.25 .50 .75 .90 .95
1 2 4 7 10
lowest : 0 1 2 3 4, highest: 58 65 67 77 90
--------------------------------------------------------------------------------
VOC1
n missing distinct Info Mean Gmd .05 .10
10000 0 76 0.987 91.75 9.38 74 83
.25 .50 .75 .90 .95
90 95 98 99 99
lowest : 0 7 17 18 19, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
VOC2
n missing distinct Info Mean Gmd .05 .10
10000 0 100 1 60.23 20.84 25 35
.25 .50 .75 .90 .95
49 62 74 83 87
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
VOC3
n missing distinct Info Mean Gmd .05 .10
10000 0 68 0.999 20.06 12.12 3.95 7.00
.25 .50 .75 .90 .95
12.00 19.00 27.00 34.00 39.00
lowest : 0 1 2 3 4, highest: 66 69 71 73 99
--------------------------------------------------------------------------------
HC1
n missing distinct Info Mean Gmd .05 .10
10000 0 32 0.99 6.83 4.276 2 3
.25 .50 .75 .90 .95
4 6 9 12 14
lowest : 0 1 2 3 4, highest: 27 28 29 30 31
--------------------------------------------------------------------------------
HC2
n missing distinct Info Mean Gmd .05 .10
10000 0 53 0.999 23.84 14.1 6 9
.25 .50 .75 .90 .95
15 21 32 43 51
lowest : 0 1 2 3 4, highest: 48 49 50 51 52
--------------------------------------------------------------------------------
HC3
n missing distinct Info Mean Gmd .05 .10
10000 0 62 0.853 2.549 3.934 0 0
.25 .50 .75 .90 .95
0 0 3 7 11
lowest : 0 1 2 3 4, highest: 76 81 82 98 99
--------------------------------------------------------------------------------
HC4
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.978 12.29 15.69 0 0
.25 .50 .75 .90 .95
0 6 17 32 47
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HC5
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.993 22.66 25.13 0 0
.25 .50 .75 .90 .95
3 16 34 57 75
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HC6
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.999 46.07 36.86 0 2
.25 .50 .75 .90 .95
16 47 73 93 98
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HC7
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.998 62.71 35.67 3.0 10.9
.25 .50 .75 .90 .95
38.0 71.0 91.0 99.0 99.0
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HC8
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.999 35.81 35.05 0 0
.25 .50 .75 .90 .95
8 28 59 87 95
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HC9
n missing distinct Info Mean Gmd .05 .10
10000 0 69 0.603 2.563 4.505 0 0
.25 .50 .75 .90 .95
0 0 1 8 16
lowest : 0 1 2 3 4, highest: 73 74 75 79 89
--------------------------------------------------------------------------------
HC10
n missing distinct Info Mean Gmd .05 .10
10000 0 32 0.61 1.29 2.215 0 0
.25 .50 .75 .90 .95
0 0 1 4 8
lowest : 0 1 2 3 4, highest: 27 28 29 30 34
--------------------------------------------------------------------------------
HC11
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.999 52.29 40.43 0.0 0.0
.25 .50 .75 .90 .95
15.0 59.5 87.0 94.1 97.0
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HC12
n missing distinct Info Mean Gmd .05 .10
10000 0 84 0.941 6.763 10.2 0 0
.25 .50 .75 .90 .95
0 1 7 24 35
lowest : 0 1 2 3 4, highest: 83 85 86 89 93
--------------------------------------------------------------------------------
HC13
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.999 29.46 30.01 0.00 2.00
.25 .50 .75 .90 .95
7.00 20.00 45.00 75.00 90.05
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HC14
n missing distinct Info Mean Gmd .05 .10
10000 0 73 0.83 4.544 7.354 0 0
.25 .50 .75 .90 .95
0 0 4 16 26
lowest : 0 1 2 3 4, highest: 71 75 84 91 95
--------------------------------------------------------------------------------
HC15
n missing distinct Info Mean Gmd .05 .10
10000 0 13 0.116 0.0693 0.1349 0 0
.25 .50 .75 .90 .95
0 0 0 0 0
lowest : 0 1 2 3 4, highest: 8 9 10 12 30
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 9597 263 84 28 14 4 3 1 2 1 1
Proportion 0.960 0.026 0.008 0.003 0.001 0.000 0.000 0.000 0.000 0.000 0.000
Value 12 30
Frequency 1 1
Proportion 0.000 0.000
--------------------------------------------------------------------------------
HC16
n missing distinct Info Mean Gmd .05 .10
10000 0 83 0.91 5.367 8.324 0 0
.25 .50 .75 .90 .95
0 1 5 17 28
lowest : 0 1 2 3 4, highest: 79 83 85 89 99
--------------------------------------------------------------------------------
HC17
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.855 82.79 25.14 8 32
.25 .50 .75 .90 .95
80 99 99 99 99
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HC18
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.88 14.97 23.23 0 0
.25 .50 .75 .90 .95
0 1 17 60 84
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HC19
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.946 72.13 36.26 2 6
.25 .50 .75 .90 .95
45 95 99 99 99
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HC20
n missing distinct Info Mean Gmd .05 .10
10000 0 32 0.451 97.54 2.78 96 97
.25 .50 .75 .90 .95
99 99 99 99 99
lowest : 0 27 38 50 60, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HC21
n missing distinct Info Mean Gmd .05 .10
10000 0 50 0.926 94.37 6.863 83 88
.25 .50 .75 .90 .95
94 98 99 99 99
lowest : 0 9 32 33 44, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
MHUC1
n missing distinct Info Mean Gmd .05 .10
10000 0 22 0.987 8.036 3.676 4 5
.25 .50 .75 .90 .95
6 8 9 12 15
lowest : 0 1 2 3 4, highest: 17 18 19 20 21
--------------------------------------------------------------------------------
MHUC2
n missing distinct Info Mean Gmd
10000 0 6 0.784 2.315 0.8402
lowest : 0 1 2 3 4, highest: 1 2 3 4 5
Value 0 1 2 3 4 5
Frequency 388 392 5845 2522 760 93
Proportion 0.039 0.039 0.585 0.252 0.076 0.009
--------------------------------------------------------------------------------
AC1
n missing distinct Info Mean Gmd .05 .10
10000 0 28 0.985 5.811 3.09 1 2
.25 .50 .75 .90 .95
4 6 7 9 11
lowest : 0 1 2 3 4, highest: 24 26 27 28 38
--------------------------------------------------------------------------------
AC2
n missing distinct Info Mean Gmd .05 .10
10000 0 29 0.989 6.005 3.524 1 2
.25 .50 .75 .90 .95
4 6 8 10 12
lowest : 0 1 2 3 4, highest: 26 28 29 31 36
--------------------------------------------------------------------------------
ADATE_2
n missing distinct Info Mean Gmd
10000 0 2 0 9706 4e-04
Value 9704 9706
Frequency 1 9999
Proportion 0 1
--------------------------------------------------------------------------------
ADATE_3
n missing distinct Info Mean Gmd
9798 202 2 0.001 9606 0.001224
Value 9604 9606
Frequency 3 9795
Proportion 0 1
--------------------------------------------------------------------------------
ADATE_4
n missing distinct Info Mean Gmd
9777 223 7 0.029 9604 0.07266
lowest : 9511 9602 9603 9604 9605, highest: 9603 9604 9605 9607 9609
Value 9511 9602 9603 9604 9605 9607 9609
Frequency 1 2 3 9682 20 54 15
Proportion 0.000 0.000 0.000 0.990 0.002 0.006 0.002
--------------------------------------------------------------------------------
ADATE_5
n missing distinct Info Mean Gmd
6457 3543 1 0 9604 0
Value 9604
Frequency 6457
Proportion 1
--------------------------------------------------------------------------------
ADATE_6
n missing distinct Info Mean Gmd
9614 386 2 0.001 9603 0.001664
Value 9601 9603
Frequency 4 9610
Proportion 0 1
--------------------------------------------------------------------------------
ADATE_7
n missing distinct Info Mean Gmd
9088 912 3 0.165 9602 0.4426
Value 9512 9601 9602
Frequency 17 515 8556
Proportion 0.002 0.057 0.941
--------------------------------------------------------------------------------
ADATE_8
n missing distinct Info Mean Gmd
9627 373 4 0.19 9595 11.23
Value 9511 9512 9601 9603
Frequency 5 646 8973 3
Proportion 0.001 0.067 0.932 0.000
--------------------------------------------------------------------------------
ADATE_9
n missing distinct Info Mean Gmd
8818 1182 3 0.113 9511 0.1354
Value 9509 9510 9511
Frequency 272 75 8471
Proportion 0.031 0.009 0.961
--------------------------------------------------------------------------------
ADATE_10
n missing distinct Info Mean Gmd
6655 3345 2 0.215 9510 0.1433
Value 9510 9511
Frequency 6138 517
Proportion 0.922 0.078
--------------------------------------------------------------------------------
ADATE_11
n missing distinct Info Mean Gmd
8919 1081 4 0.651 9510 0.4792
Value 9508 9509 9510 9511
Frequency 6 2294 6173 446
Proportion 0.001 0.257 0.692 0.050
--------------------------------------------------------------------------------
ADATE_12
n missing distinct Info Mean Gmd
9071 929 4 0.742 9508 0.5096
Value 9507 9508 9509 9510
Frequency 5 5146 3839 81
Proportion 0.001 0.567 0.423 0.009
--------------------------------------------------------------------------------
ADATE_13
n missing distinct Info Mean Gmd
5744 4256 3 0.024 9507 0.05884
Value 9502 9506 9507
Frequency 31 15 5698
Proportion 0.005 0.003 0.992
--------------------------------------------------------------------------------
ADATE_14
n missing distinct Info Mean Gmd
8021 1979 2 0.005 9506 0.006473
Value 9504 9506
Frequency 13 8008
Proportion 0.002 0.998
--------------------------------------------------------------------------------
ADATE_15
n missing distinct Info Mean Gmd
3078 6922 1 0 9504 0
Value 9504
Frequency 3078
Proportion 1
--------------------------------------------------------------------------------
ADATE_16
n missing distinct Info Mean Gmd
7895 2105 3 0.062 9503 0.04179
Value 9502 9503 9504
Frequency 13 7727 155
Proportion 0.002 0.979 0.020
--------------------------------------------------------------------------------
ADATE_17
n missing distinct Info Mean Gmd
7083 2917 3 0.217 9502 0.1447
Value 9501 9502 9503
Frequency 555 6527 1
Proportion 0.078 0.922 0.000
--------------------------------------------------------------------------------
ADATE_18
n missing distinct Info Mean Gmd
7709 2291 6 0.724 9465 42.9
lowest : 9409 9411 9412 9501 9504, highest: 9411 9412 9501 9504 9505
Value 9409 9411 9412 9501 9504 9505
Frequency 3 8 3109 4586 1 2
Proportion 0.000 0.001 0.403 0.595 0.000 0.000
--------------------------------------------------------------------------------
ADATE_19
n missing distinct Info Mean Gmd
7462 2538 3 0.064 9411 0.08317
Value 9409 9410 9411
Frequency 154 9 7299
Proportion 0.021 0.001 0.978
--------------------------------------------------------------------------------
ADATE_20
n missing distinct Info Mean Gmd
4753 5247 2 0.001 9411 0.0004208
Value 9411 9412
Frequency 4752 1
Proportion 1 0
--------------------------------------------------------------------------------
ADATE_21
n missing distinct Info Mean Gmd
6342 3658 2 0.195 9410 0.13
Value 9409 9410
Frequency 443 5899
Proportion 0.07 0.93
--------------------------------------------------------------------------------
ADATE_22
n missing distinct Info Mean Gmd
7304 2696 4 0.224 9410 2.233
Value 9408 9409 9410 9506
Frequency 512 6710 2 80
Proportion 0.070 0.919 0.000 0.011
--------------------------------------------------------------------------------
ADATE_23
n missing distinct Info Mean Gmd
4123 5877 3 0.022 9407 0.06053
Value 9312 9406 9407
Frequency 1 30 4092
Proportion 0.000 0.007 0.992
--------------------------------------------------------------------------------
ADATE_24
n missing distinct Info Mean Gmd
6186 3814 2 0.015 9406 0.01029
Value 9405 9406
Frequency 32 6154
Proportion 0.005 0.995
--------------------------------------------------------------------------------
RFA_2
n missing distinct
10000 0 14
lowest : L1E L1F L1G L2E L2F, highest: L3G L4D L4E L4F L4G
Value L1E L1F L1G L2E L2F L2G L3D L3E L3F L3G L4D
Frequency 550 3237 1269 536 1174 476 234 823 363 149 504
Proportion 0.055 0.324 0.127 0.054 0.117 0.048 0.023 0.082 0.036 0.015 0.050
Value L4E L4F L4G
Frequency 388 206 91
Proportion 0.039 0.021 0.009
--------------------------------------------------------------------------------
RFA_3
n missing distinct
9798 202 60
lowest : A1D A1E A1F A1G A2D, highest: S3G S4D S4E S4F S4G
--------------------------------------------------------------------------------
RFA_4
n missing distinct
9769 231 56
lowest : A1D A1E A1F A1G A2D, highest: S3G S4D S4E S4F S4G
--------------------------------------------------------------------------------
RFA_5
n missing distinct
6457 3543 39
lowest : A1F A1G A2E A2F A2G, highest: S3G S4D S4E S4F S4G
--------------------------------------------------------------------------------
RFA_6
n missing distinct
9614 386 92
lowest : A1A A1C A1D A1E A1F, highest: S4F S4G U1E U1F U1G
--------------------------------------------------------------------------------
RFA_7
n missing distinct
9088 912 89
lowest : A1C A1D A1E A1F A1G, highest: S4C S4D S4E S4F S4G
--------------------------------------------------------------------------------
RFA_8
n missing distinct
9627 373 92
lowest : A1B A1C A1D A1E A1F, highest: S4F S4G U1E U1F U1G
--------------------------------------------------------------------------------
RFA_9
n missing distinct
8818 1182 99
lowest : A1A A1B A1C A1D A1E, highest: S4E S4F S4G U1F U1G
--------------------------------------------------------------------------------
RFA_10
n missing distinct
6655 3345 79
lowest : A1C A1D A1E A1F A1G, highest: S4C S4D S4E S4F S4G
--------------------------------------------------------------------------------
RFA_11
n missing distinct
8919 1081 95
lowest : A1B A1C A1D A1E A1F, highest: S4C S4D S4E S4F S4G
--------------------------------------------------------------------------------
RFA_12
n missing distinct
9071 929 99
lowest : A1B A1C A1D A1E A1F, highest: S4G U1D U1E U1F U1G
--------------------------------------------------------------------------------
RFA_13
n missing distinct
5744 4256 65
lowest : A1E A1F A1G A2D A2E, highest: S4C S4D S4E S4F S4G
--------------------------------------------------------------------------------
RFA_14
n missing distinct
8021 1979 87
lowest : A1C A1D A1E A1F A1G, highest: S4D S4E S4F S4G U1D
--------------------------------------------------------------------------------
RFA_15
n missing distinct
3078 6922 31
lowest : A1G A2F A2G A3D A3E, highest: S3G S4D S4E S4F S4G
--------------------------------------------------------------------------------
RFA_16
n missing distinct
7890 2110 106
lowest : A1A A1B A1C A1D A1E, highest: U1C U1D U1E U1F U1G
--------------------------------------------------------------------------------
RFA_17
n missing distinct
7083 2917 105
lowest : A1B A1C A1D A1E A1F, highest: S4F S4G U1D U1E U1F
--------------------------------------------------------------------------------
describe(data.val[,301:400])
data.val[, 301:400]
100 Variables 10000 Observations
--------------------------------------------------------------------------------
SEC4
n missing distinct Info Mean Gmd .05 .10
10000 0 45 0.997 17.06 7.005 5 9
.25 .50 .75 .90 .95
13 18 21 24 26
lowest : 0 1 2 3 4, highest: 40 41 42 44 48
--------------------------------------------------------------------------------
SEC5
n missing distinct Info Mean Gmd .05 .10
10000 0 68 0.991 6.731 4.518 2 2
.25 .50 .75 .90 .95
4 6 8 11 14
lowest : 0 1 2 3 4, highest: 75 85 86 89 99
--------------------------------------------------------------------------------
AFC1
n missing distinct Info Mean Gmd .05 .10
10000 0 45 0.439 0.5779 1.072 0 0
.25 .50 .75 .90 .95
0 0 0 1 2
lowest : 0 1 2 3 4, highest: 69 72 74 83 86
--------------------------------------------------------------------------------
AFC2
n missing distinct Info Mean Gmd .05 .10
10000 0 62 0.536 1.028 1.881 0 0
.25 .50 .75 .90 .95
0 0 0 2 4
lowest : 0 1 2 3 4, highest: 90 93 94 95 97
--------------------------------------------------------------------------------
AFC3
n missing distinct Info Mean Gmd .05 .10
10000 0 20 0.164 0.145 0.2812 0 0
.25 .50 .75 .90 .95
0 0 0 0 1
lowest : 0 1 2 3 4, highest: 18 20 29 53 60
Value 0 1 2 3 4 5 6 7 8 9 13
Frequency 9422 312 118 57 32 17 13 4 8 3 1
Proportion 0.942 0.031 0.012 0.006 0.003 0.002 0.001 0.000 0.001 0.000 0.000
Value 14 15 16 17 18 20 29 53 60
Frequency 1 2 2 2 2 1 1 1 1
Proportion 0.000 0.000 0.000 0.000 0.000 0.000 0.000 0.000 0.000
--------------------------------------------------------------------------------
AFC4
n missing distinct Info Mean Gmd .05 .10
10000 0 46 0.996 15.59 5.74 7 9
.25 .50 .75 .90 .95
12 15 19 22 24
lowest : 0 1 2 3 4, highest: 45 46 50 55 63
--------------------------------------------------------------------------------
AFC5
n missing distinct Info Mean Gmd .05 .10
10000 0 79 0.999 31.36 11.54 15 19
.25 .50 .75 .90 .95
25 31 37 44 49
lowest : 0 1 2 3 4, highest: 79 82 83 84 85
--------------------------------------------------------------------------------
AFC6
n missing distinct Info Mean Gmd .05 .10
10000 0 21 0.909 1.244 1.504 0 0
.25 .50 .75 .90 .95
0 1 2 3 4
lowest : 0 1 2 3 4, highest: 16 20 25 35 50
--------------------------------------------------------------------------------
VC1
n missing distinct Info Mean Gmd .05 .10
10000 0 91 0.999 30.54 16.01 7 13
.25 .50 .75 .90 .95
21 30 39 49 55
lowest : 0 1 2 3 4, highest: 87 88 89 91 99
--------------------------------------------------------------------------------
VC2
n missing distinct Info Mean Gmd .05 .10
10000 0 72 0.998 17.88 10.5 0 6
.25 .50 .75 .90 .95
12 17 23 30 34
lowest : 0 1 2 3 4, highest: 76 80 81 88 99
--------------------------------------------------------------------------------
VC3
n missing distinct Info Mean Gmd .05 .10
10000 0 99 1 33.78 19.04 8 13
.25 .50 .75 .90 .95
22 32 44 56 64
lowest : 0 1 2 3 4, highest: 94 95 96 98 99
--------------------------------------------------------------------------------
VC4
n missing distinct Info Mean Gmd .05 .10
10000 0 79 0.994 11.16 10.69 0 0
.25 .50 .75 .90 .95
4 9 15 24 31
lowest : 0 1 2 3 4, highest: 85 86 89 93 99
--------------------------------------------------------------------------------
ANC1
n missing distinct Info Mean Gmd .05 .10
10000 0 38 0.768 0.731 1.093 0 0
.25 .50 .75 .90 .95
0 0 1 2 2
lowest : 0 1 2 3 4, highest: 44 45 48 50 70
--------------------------------------------------------------------------------
ANC2
n missing distinct Info Mean Gmd .05 .10
10000 0 41 0.993 5.519 4.547 0 1
.25 .50 .75 .90 .95
2 5 8 11 13
lowest : 0 1 2 3 4, highest: 39 40 43 45 47
--------------------------------------------------------------------------------
ANC3
n missing distinct Info Mean Gmd .05 .10
10000 0 20 0.85 0.8837 1.092 0 0
.25 .50 .75 .90 .95
0 1 1 2 3
lowest : 0 1 2 3 4, highest: 15 16 17 25 32
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 4471 3898 1034 301 88 61 42 22 24 14 13
Proportion 0.447 0.390 0.103 0.030 0.009 0.006 0.004 0.002 0.002 0.001 0.001
Value 11 12 13 14 15 16 17 25 32
Frequency 6 9 3 3 2 4 3 1 1
Proportion 0.001 0.001 0.000 0.000 0.000 0.000 0.000 0.000 0.000
--------------------------------------------------------------------------------
ANC4
n missing distinct Info Mean Gmd .05 .10
10000 0 68 0.997 10.05 8.02 1 2
.25 .50 .75 .90 .95
5 8 13 19 26
lowest : 0 1 2 3 4, highest: 65 67 70 73 99
--------------------------------------------------------------------------------
ANC5
n missing distinct Info Mean Gmd .05 .10
10000 0 13 0.337 0.2016 0.3669 0 0
.25 .50 .75 .90 .95
0 0 0 1 1
lowest : 0 1 2 3 4, highest: 8 9 10 12 32
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 8715 920 210 68 44 16 14 4 4 2 1
Proportion 0.872 0.092 0.021 0.007 0.004 0.002 0.001 0.000 0.000 0.000 0.000
Value 12 32
Frequency 1 1
Proportion 0.000 0.000
--------------------------------------------------------------------------------
ANC6
n missing distinct Info Mean Gmd .05 .10
10000 0 11 0.359 0.1767 0.3136 0 0
.25 .50 .75 .90 .95
0 0 0 1 1
lowest : 0 1 2 3 4, highest: 6 7 8 9 11
Value 0 1 2 3 4 5 6 7 8 9 11
Frequency 8614 1133 181 47 12 6 3 1 1 1 1
Proportion 0.861 0.113 0.018 0.005 0.001 0.001 0.000 0.000 0.000 0.000 0.000
--------------------------------------------------------------------------------
ANC7
n missing distinct Info Mean Gmd .05 .10
10000 0 32 0.989 4.854 3.539 0 1
.25 .50 .75 .90 .95
3 4 7 9 11
lowest : 0 1 2 3 4, highest: 28 31 33 42 44
--------------------------------------------------------------------------------
ANC8
n missing distinct Info Mean Gmd .05 .10
10000 0 27 0.937 1.815 2.241 0 0
.25 .50 .75 .90 .95
0 1 2 5 6
lowest : 0 1 2 3 4, highest: 23 25 27 31 41
--------------------------------------------------------------------------------
ANC9
n missing distinct Info Mean Gmd .05 .10
10000 0 31 0.704 0.7559 1.223 0 0
.25 .50 .75 .90 .95
0 0 1 2 3
lowest : 0 1 2 3 4, highest: 28 31 37 44 50
--------------------------------------------------------------------------------
ANC10
n missing distinct Info Mean Gmd .05 .10
10000 0 40 0.858 1.319 1.945 0 0
.25 .50 .75 .90 .95
0 1 1 3 5
lowest : 0 1 2 3 4, highest: 43 49 55 56 57
--------------------------------------------------------------------------------
ANC11
n missing distinct Info Mean Gmd .05 .10
10000 0 18 0.241 0.1503 0.2835 0 0
.25 .50 .75 .90 .95
0 0 0 0 1
lowest : 0 1 2 3 4, highest: 14 15 17 25 28
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 9120 639 115 54 25 16 13 5 3 1 1
Proportion 0.912 0.064 0.012 0.005 0.002 0.002 0.001 0.000 0.000 0.000 0.000
Value 12 13 14 15 17 25 28
Frequency 2 1 1 1 1 1 1
Proportion 0.000 0.000 0.000 0.000 0.000 0.000 0.000
--------------------------------------------------------------------------------
ANC12
n missing distinct Info Mean Gmd .05 .10
10000 0 23 0.457 0.4283 0.7708 0 0
.25 .50 .75 .90 .95
0 0 0 1 2
lowest : 0 1 2 3 4, highest: 18 19 20 22 23
--------------------------------------------------------------------------------
ANC13
n missing distinct Info Mean Gmd .05 .10
10000 0 12 0.835 0.735 0.8933 0 0
.25 .50 .75 .90 .95
0 1 1 2 2
lowest : 0 1 2 3 4, highest: 7 8 9 10 18
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 4856 3679 1015 299 80 43 13 6 4 1 3
Proportion 0.486 0.368 0.102 0.030 0.008 0.004 0.001 0.001 0.000 0.000 0.000
Value 18
Frequency 1
Proportion 0.000
--------------------------------------------------------------------------------
ANC14
n missing distinct Info Mean Gmd .05 .10
10000 0 21 0.794 0.7356 1.042 0 0
.25 .50 .75 .90 .95
0 0 1 2 3
lowest : 0 1 2 3 4, highest: 16 19 20 21 26
--------------------------------------------------------------------------------
ANC15
n missing distinct Info Mean Gmd
10000 0 8 0.171 0.0759 0.1443
lowest : 0 1 2 3 4, highest: 3 4 5 6 14
Value 0 1 2 3 4 5 6 14
Frequency 9393 512 71 16 3 1 2 2
Proportion 0.939 0.051 0.007 0.002 0.000 0.000 0.000 0.000
--------------------------------------------------------------------------------
POBC1
n missing distinct Info Mean Gmd .05 .10
10000 0 85 0.988 6.847 8.709 0 0
.25 .50 .75 .90 .95
1 3 8 18 27
lowest : 0 1 2 3 4, highest: 82 83 84 85 86
--------------------------------------------------------------------------------
POBC2
n missing distinct Info Mean Gmd .05 .10
10000 0 100 1 56.85 24.64 17 26
.25 .50 .75 .90 .95
42 59 75 84 87
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
LSC1
n missing distinct Info Mean Gmd .05 .10
10000 0 99 0.995 87.78 14.04 53 71
.25 .50 .75 .90 .95
86 94 97 98 99
lowest : 0 2 3 4 5, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
LSC2
n missing distinct Info Mean Gmd .05 .10
10000 0 98 0.973 5.924 8.57 0 0
.25 .50 .75 .90 .95
1 2 5 14 28
lowest : 0 1 2 3 4, highest: 94 95 96 98 99
--------------------------------------------------------------------------------
LSC3
n missing distinct Info Mean Gmd .05 .10
10000 0 56 0.782 1.966 3.265 0 0
.25 .50 .75 .90 .95
0 0 2 5 10
lowest : 0 1 2 3 4, highest: 57 58 59 75 78
--------------------------------------------------------------------------------
LSC4
n missing distinct Info Mean Gmd .05 .10
10000 0 52 0.978 3.533 3.819 0 0
.25 .50 .75 .90 .95
1 2 4 8 11
lowest : 0 1 2 3 4, highest: 60 62 66 77 99
--------------------------------------------------------------------------------
VOC1
n missing distinct Info Mean Gmd .05 .10
10000 0 81 0.986 91.81 9.413 73 82
.25 .50 .75 .90 .95
91 96 98 99 99
lowest : 0 7 14 15 16, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
VOC2
n missing distinct Info Mean Gmd .05 .10
10000 0 100 1 60.49 20.9 25 35
.25 .50 .75 .90 .95
50 63 74 83 87
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
VOC3
n missing distinct Info Mean Gmd .05 .10
10000 0 68 0.999 20.05 12.26 3 6
.25 .50 .75 .90 .95
12 19 27 35 39
lowest : 0 1 2 3 4, highest: 65 66 71 79 99
--------------------------------------------------------------------------------
HC1
n missing distinct Info Mean Gmd .05 .10
10000 0 32 0.99 6.949 4.361 2 3
.25 .50 .75 .90 .95
4 6 9 12 14
lowest : 0 1 2 3 4, highest: 27 28 29 30 31
--------------------------------------------------------------------------------
HC2
n missing distinct Info Mean Gmd .05 .10
10000 0 53 0.999 23.95 14.07 6 9
.25 .50 .75 .90 .95
15 22 33 43 51
lowest : 0 1 2 3 4, highest: 48 49 50 51 52
--------------------------------------------------------------------------------
HC3
n missing distinct Info Mean Gmd .05 .10
10000 0 61 0.858 2.583 3.967 0 0
.25 .50 .75 .90 .95
0 0 3 7 11
lowest : 0 1 2 3 4, highest: 79 82 91 98 99
--------------------------------------------------------------------------------
HC4
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.977 12.4 15.87 0 0
.25 .50 .75 .90 .95
0 6 17 33 48
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HC5
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.993 22.59 25.09 0 0
.25 .50 .75 .90 .95
3 16 33 58 76
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HC6
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.999 46.1 37.05 0 2
.25 .50 .75 .90 .95
16 46 73 93 98
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HC7
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.998 62.43 35.86 2 10
.25 .50 .75 .90 .95
38 70 91 99 99
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HC8
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.999 36.22 35.32 0 0
.25 .50 .75 .90 .95
8 29 59 88 96
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HC9
n missing distinct Info Mean Gmd .05 .10
10000 0 73 0.62 2.918 5.112 0 0
.25 .50 .75 .90 .95
0 0 1 9 18
lowest : 0 1 2 3 4, highest: 74 76 79 81 83
--------------------------------------------------------------------------------
HC10
n missing distinct Info Mean Gmd .05 .10
10000 0 41 0.618 1.335 2.291 0 0
.25 .50 .75 .90 .95
0 0 1 4 8
lowest : 0 1 2 3 4, highest: 36 37 38 44 49
--------------------------------------------------------------------------------
HC11
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.999 52.17 40.61 0 1
.25 .50 .75 .90 .95
14 59 86 95 97
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HC12
n missing distinct Info Mean Gmd .05 .10
10000 0 79 0.945 6.853 10.29 0 0
.25 .50 .75 .90 .95
0 1 7 24 35
lowest : 0 1 2 3 4, highest: 75 86 87 90 99
--------------------------------------------------------------------------------
HC13
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.999 29.8 30.42 0 2
.25 .50 .75 .90 .95
7 20 46 77 91
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HC14
n missing distinct Info Mean Gmd .05 .10
10000 0 75 0.841 4.425 7.126 0 0
.25 .50 .75 .90 .95
0 0 4 15 25
lowest : 0 1 2 3 4, highest: 84 89 93 94 97
--------------------------------------------------------------------------------
HC15
n missing distinct Info Mean Gmd .05 .10
10000 0 13 0.114 0.0727 0.1418 0 0
.25 .50 .75 .90 .95
0 0 0 0 0
lowest : 0 1 2 3 4, highest: 8 9 10 14 30
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 9603 258 73 38 12 5 2 2 1 2 1
Proportion 0.960 0.026 0.007 0.004 0.001 0.000 0.000 0.000 0.000 0.000 0.000
Value 14 30
Frequency 1 2
Proportion 0.000 0.000
--------------------------------------------------------------------------------
HC16
n missing distinct Info Mean Gmd .05 .10
10000 0 80 0.905 5.238 8.123 0 0
.25 .50 .75 .90 .95
0 1 5 16 27
lowest : 0 1 2 3 4, highest: 76 77 79 81 93
--------------------------------------------------------------------------------
HC17
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.847 82.83 25.11 7.95 34.00
.25 .50 .75 .90 .95
79.75 99.00 99.00 99.00 99.00
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HC18
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.875 15.09 23.46 0 0
.25 .50 .75 .90 .95
0 1 17 61 85
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HC19
n missing distinct Info Mean Gmd .05 .10
10000 0 100 0.946 72.24 36.19 2 6
.25 .50 .75 .90 .95
45 95 99 99 99
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HC20
n missing distinct Info Mean Gmd .05 .10
10000 0 34 0.447 97.7 2.477 95 97
.25 .50 .75 .90 .95
99 99 99 99 99
lowest : 0 46 60 62 64, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HC21
n missing distinct Info Mean Gmd .05 .10
10000 0 50 0.92 94.58 6.54 83 88
.25 .50 .75 .90 .95
94 98 99 99 99
lowest : 0 32 36 48 52, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
MHUC1
n missing distinct Info Mean Gmd .05 .10
10000 0 22 0.986 8.158 3.694 4 5
.25 .50 .75 .90 .95
6 8 9 12 15
lowest : 0 1 2 3 4, highest: 17 18 19 20 21
--------------------------------------------------------------------------------
MHUC2
n missing distinct Info Mean Gmd
10000 0 6 0.789 2.339 0.853
lowest : 0 1 2 3 4, highest: 1 2 3 4 5
Value 0 1 2 3 4 5
Frequency 374 372 5786 2528 842 98
Proportion 0.037 0.037 0.579 0.253 0.084 0.010
--------------------------------------------------------------------------------
AC1
n missing distinct Info Mean Gmd .05 .10
10000 0 25 0.986 5.836 3.071 1.95 2.00
.25 .50 .75 .90 .95
4.00 6.00 7.00 9.00 11.00
lowest : 0 1 2 3 4, highest: 20 21 22 23 24
--------------------------------------------------------------------------------
AC2
n missing distinct Info Mean Gmd .05 .10
10000 0 31 0.989 5.999 3.471 1 2
.25 .50 .75 .90 .95
4 6 8 10 12
lowest : 0 1 2 3 4, highest: 26 27 28 35 36
--------------------------------------------------------------------------------
ADATE_2
n missing distinct Info Mean Gmd
10000 0 1 0 9706 0
Value 9706
Frequency 10000
Proportion 1
--------------------------------------------------------------------------------
ADATE_3
n missing distinct Info Mean Gmd
9793 207 2 0 9606 0.0004085
Value 9604 9606
Frequency 1 9792
Proportion 0 1
--------------------------------------------------------------------------------
ADATE_4
n missing distinct Info Mean Gmd
9780 220 6 0.027 9604 0.05021
lowest : 9602 9603 9604 9605 9607, highest: 9603 9604 9605 9607 9609
Value 9602 9603 9604 9605 9607 9609
Frequency 3 3 9690 26 39 19
Proportion 0.000 0.000 0.991 0.003 0.004 0.002
--------------------------------------------------------------------------------
ADATE_5
n missing distinct Info Mean Gmd
6547 3453 1 0 9604 0
Value 9604
Frequency 6547
Proportion 1
--------------------------------------------------------------------------------
ADATE_6
n missing distinct Info Mean Gmd
9635 365 2 0.002 9603 0.002904
Value 9601 9603
Frequency 7 9628
Proportion 0.001 0.999
--------------------------------------------------------------------------------
ADATE_7
n missing distinct Info Mean Gmd
9096 904 3 0.152 9602 0.2969
Value 9512 9601 9602
Frequency 10 477 8609
Proportion 0.001 0.052 0.946
--------------------------------------------------------------------------------
ADATE_8
n missing distinct Info Mean Gmd
9669 331 4 0.203 9595 11.97
Value 9511 9512 9601 9603
Frequency 6 695 8965 3
Proportion 0.001 0.072 0.927 0.000
--------------------------------------------------------------------------------
ADATE_9
n missing distinct Info Mean Gmd
8828 1172 3 0.119 9511 0.1426
Value 9509 9510 9511
Frequency 288 78 8462
Proportion 0.033 0.009 0.959
--------------------------------------------------------------------------------
ADATE_10
n missing distinct Info Mean Gmd
6659 3341 2 0.228 9510 0.1521
Value 9510 9511
Frequency 6107 552
Proportion 0.917 0.083
--------------------------------------------------------------------------------
ADATE_11
n missing distinct Info Mean Gmd
8973 1027 4 0.656 9510 0.4812
Value 9508 9509 9510 9511
Frequency 8 2364 6175 426
Proportion 0.001 0.263 0.688 0.047
--------------------------------------------------------------------------------
ADATE_12
n missing distinct Info Mean Gmd
9127 873 4 0.743 9508 0.5122
Value 9507 9508 9509 9510
Frequency 8 5166 3864 89
Proportion 0.001 0.566 0.423 0.010
--------------------------------------------------------------------------------
ADATE_13
n missing distinct Info Mean Gmd
5833 4167 3 0.033 9507 0.06536
Value 9502 9506 9507
Frequency 32 32 5769
Proportion 0.005 0.005 0.989
--------------------------------------------------------------------------------
ADATE_14
n missing distinct Info Mean Gmd
8068 1932 2 0.009 9506 0.01186
Value 9504 9506
Frequency 24 8044
Proportion 0.003 0.997
--------------------------------------------------------------------------------
ADATE_15
n missing distinct Info Mean Gmd
3203 6797 1 0 9504 0
Value 9504
Frequency 3203
Proportion 1
--------------------------------------------------------------------------------
ADATE_16
n missing distinct Info Mean Gmd
7895 2105 3 0.06 9503 0.04034
Value 9502 9503 9504
Frequency 15 7733 147
Proportion 0.002 0.979 0.019
--------------------------------------------------------------------------------
ADATE_17
n missing distinct Info Mean Gmd
7104 2896 3 0.224 9502 0.1494
Value 9501 9502 9503
Frequency 575 6527 2
Proportion 0.081 0.919 0.000
--------------------------------------------------------------------------------
ADATE_18
n missing distinct Info Mean Gmd
7784 2216 8 0.73 9464 43.19
lowest : 9409 9411 9412 9501 9502, highest: 9501 9502 9503 9504 9508
Value 9409 9411 9412 9501 9502 9503 9504 9508
Frequency 3 17 3198 4556 2 3 2 3
Proportion 0.000 0.002 0.411 0.585 0.000 0.000 0.000 0.000
--------------------------------------------------------------------------------
ADATE_19
n missing distinct Info Mean Gmd
7416 2584 3 0.076 9411 0.09807
Value 9409 9410 9411
Frequency 180 13 7223
Proportion 0.024 0.002 0.974
--------------------------------------------------------------------------------
ADATE_20
n missing distinct Info Mean Gmd
4711 5289 1 0 9411 0
Value 9411
Frequency 4711
Proportion 1
--------------------------------------------------------------------------------
ADATE_21
n missing distinct Info Mean Gmd
6327 3673 2 0.217 9410 0.1445
Value 9409 9410
Frequency 496 5831
Proportion 0.078 0.922
--------------------------------------------------------------------------------
ADATE_22
n missing distinct Info Mean Gmd
7305 2695 5 0.246 9410 2.838
lowest : 9408 9409 9410 9411 9506, highest: 9408 9409 9410 9411 9506
Value 9408 9409 9410 9411 9506
Frequency 553 6647 1 1 103
Proportion 0.076 0.910 0.000 0.000 0.014
--------------------------------------------------------------------------------
ADATE_23
n missing distinct Info Mean Gmd
4126 5874 3 0.025 9407 0.199
Value 9312 9406 9407
Frequency 4 31 4091
Proportion 0.001 0.008 0.992
--------------------------------------------------------------------------------
ADATE_24
n missing distinct Info Mean Gmd
6061 3939 2 0.015 9406 0.01018
Value 9405 9406
Frequency 31 6030
Proportion 0.005 0.995
--------------------------------------------------------------------------------
RFA_2
n missing distinct
10000 0 14
lowest : L1E L1F L1G L2E L2F, highest: L3G L4D L4E L4F L4G
Value L1E L1F L1G L2E L2F L2G L3D L3E L3F L3G L4D
Frequency 515 3152 1296 509 1208 517 241 785 368 164 488
Proportion 0.052 0.315 0.130 0.051 0.121 0.052 0.024 0.078 0.037 0.016 0.049
Value L4E L4F L4G
Frequency 432 216 109
Proportion 0.043 0.022 0.011
--------------------------------------------------------------------------------
RFA_3
n missing distinct
9793 207 60
lowest : A1D A1E A1F A1G A2D, highest: S3G S4D S4E S4F S4G
--------------------------------------------------------------------------------
RFA_4
n missing distinct
9763 237 56
lowest : A1D A1E A1F A1G A2D, highest: S3G S4D S4E S4F S4G
--------------------------------------------------------------------------------
RFA_5
n missing distinct
6547 3453 40
lowest : A1F A1G A2E A2F A2G, highest: S3G S4D S4E S4F S4G
--------------------------------------------------------------------------------
RFA_6
n missing distinct
9635 365 91
lowest : A1A A1B A1C A1D A1E, highest: S4F S4G U1D U1E U1G
--------------------------------------------------------------------------------
RFA_7
n missing distinct
9096 904 87
lowest : A1B A1C A1D A1E A1F, highest: S4C S4D S4E S4F S4G
--------------------------------------------------------------------------------
RFA_8
n missing distinct
9669 331 90
lowest : A1B A1C A1D A1E A1F, highest: S4E S4F S4G U1D U1E
--------------------------------------------------------------------------------
RFA_9
n missing distinct
8828 1172 96
lowest : A1A A1B A1C A1D A1E, highest: S4D S4E S4F S4G U1G
--------------------------------------------------------------------------------
RFA_10
n missing distinct
6659 3341 76
lowest : A1C A1D A1E A1F A1G, highest: S4C S4D S4E S4F S4G
--------------------------------------------------------------------------------
RFA_11
n missing distinct
8973 1027 94
lowest : A1B A1C A1D A1E A1F, highest: S4C S4D S4E S4F S4G
--------------------------------------------------------------------------------
RFA_12
n missing distinct
9127 873 98
lowest : A1B A1C A1D A1E A1F, highest: S4G U1D U1E U1F U1G
--------------------------------------------------------------------------------
RFA_13
n missing distinct
5833 4167 63
lowest : A1E A1F A1G A2D A2E, highest: S4D S4E S4F S4G U1D
--------------------------------------------------------------------------------
RFA_14
n missing distinct
8068 1932 89
lowest : A1C A1D A1E A1F A1G, highest: S4F S4G U1C U1D U1G
--------------------------------------------------------------------------------
RFA_15
n missing distinct
3203 6797 33
lowest : A1G A2F A2G A3D A3E, highest: S4D S4E S4F S4G U1D
--------------------------------------------------------------------------------
RFA_16
n missing distinct
7889 2111 109
lowest : A1A A1B A1C A1D A1E, highest: U1C U1D U1E U1F U1G
--------------------------------------------------------------------------------
RFA_17
n missing distinct
7104 2896 109
lowest : A1C A1D A1E A1F A1G, highest: S4F S4G U1E U1F U1G
--------------------------------------------------------------------------------
# Pas dans les 400 premières
# Elles se situent donc faire la fin du jeu d'entrainement essayons donc de partir des dernières variables
describe(data.train[,470:481])
data.train[, 470:481]
12 Variables 10000 Observations
--------------------------------------------------------------------------------
CONTROLN
n missing distinct Info Mean Gmd .05 .10
10000 0 10000 1 95917 63825 10222 19677
.25 .50 .75 .90 .95
48131 95716 144420 172345 182048
lowest : 27 65 94 103 107, highest: 191649 191673 191716 191754 191758
--------------------------------------------------------------------------------
TARGET_B
n missing distinct Info Sum Mean Gmd
10000 0 2 0.145 509 0.0509 0.09663
--------------------------------------------------------------------------------
TARGET_D
n missing distinct Info Mean Gmd .05 .10
10000 0 44 0.145 0.7419 1.432 0 0
.25 .50 .75 .90 .95
0 0 0 0 3
lowest : 0.0 1.0 2.0 2.5 3.0, highest: 45.0 50.0 51.0 60.0 100.0
--------------------------------------------------------------------------------
HPHONE_D
n missing distinct Info Sum Mean Gmd
10000 0 2 0.75 5118 0.5118 0.4998
--------------------------------------------------------------------------------
RFA_2R
n missing distinct value
10000 0 1 L
Value L
Frequency 10000
Proportion 1
--------------------------------------------------------------------------------
RFA_2F
n missing distinct Info Mean Gmd
10000 0 4 0.855 1.889 1.109
Value 1 2 3 4
Frequency 5056 2186 1569 1189
Proportion 0.506 0.219 0.157 0.119
--------------------------------------------------------------------------------
RFA_2A
n missing distinct
10000 0 4
Value D E F G
Frequency 738 2297 4980 1985
Proportion 0.074 0.230 0.498 0.198
--------------------------------------------------------------------------------
MDMAUD_R
n missing distinct
10000 0 5
lowest : C D I L X, highest: C D I L X
Value C D I L X
Frequency 7 4 5 4 9980
Proportion 0.001 0.000 0.000 0.000 0.998
--------------------------------------------------------------------------------
MDMAUD_F
n missing distinct
10000 0 3
Value 1 2 X
Frequency 12 8 9980
Proportion 0.001 0.001 0.998
--------------------------------------------------------------------------------
MDMAUD_A
n missing distinct
10000 0 4
Value C L M X
Frequency 17 1 2 9980
Proportion 0.002 0.000 0.000 0.998
--------------------------------------------------------------------------------
CLUSTER2
n missing distinct Info Mean Gmd .05 .10
9989 11 62 1 31.59 21.53 4.0 5.0
.25 .50 .75 .90 .95
15.0 32.0 49.0 57.2 60.0
lowest : 1 2 3 4 5, highest: 58 59 60 61 62
--------------------------------------------------------------------------------
GEOCODE2
n missing distinct
9964 36 4
Value A B C D
Frequency 3602 3055 1614 1693
Proportion 0.362 0.307 0.162 0.170
--------------------------------------------------------------------------------
describe(data.val[,470:479])
data.val[, 470:479]
10 Variables 10000 Observations
--------------------------------------------------------------------------------
CONTROLN
n missing distinct Info Mean Gmd .05 .10
10000 0 10000 1 95236 63996 9161 18334
.25 .50 .75 .90 .95
46671 95653 142997 172030 182307
lowest : 28 89 179 194 203, highest: 191720 191767 191773 191774 191776
--------------------------------------------------------------------------------
HPHONE_D
n missing distinct Info Sum Mean Gmd
10000 0 2 0.75 5064 0.5064 0.5
--------------------------------------------------------------------------------
RFA_2R
n missing distinct value
10000 0 1 L
Value L
Frequency 10000
Proportion 1
--------------------------------------------------------------------------------
RFA_2F
n missing distinct Info Mean Gmd
10000 0 4 0.861 1.909 1.122
Value 1 2 3 4
Frequency 4963 2234 1558 1245
Proportion 0.496 0.223 0.156 0.124
--------------------------------------------------------------------------------
RFA_2A
n missing distinct
10000 0 4
Value D E F G
Frequency 729 2241 4944 2086
Proportion 0.073 0.224 0.494 0.209
--------------------------------------------------------------------------------
MDMAUD_R
n missing distinct
10000 0 5
lowest : C D I L X, highest: C D I L X
Value C D I L X
Frequency 9 6 6 7 9972
Proportion 0.001 0.001 0.001 0.001 0.997
--------------------------------------------------------------------------------
MDMAUD_F
n missing distinct
10000 0 4
Value 1 2 5 X
Frequency 20 7 1 9972
Proportion 0.002 0.001 0.000 0.997
--------------------------------------------------------------------------------
MDMAUD_A
n missing distinct
10000 0 4
Value C L M X
Frequency 23 2 3 9972
Proportion 0.002 0.000 0.000 0.997
--------------------------------------------------------------------------------
CLUSTER2
n missing distinct Info Mean Gmd .05 .10
9985 15 62 1 31.17 21.66 4 5
.25 .50 .75 .90 .95
14 31 49 57 60
lowest : 1 2 3 4 5, highest: 58 59 60 61 62
--------------------------------------------------------------------------------
GEOCODE2
n missing distinct
9971 29 4
Value A B C D
Frequency 3662 2941 1649 1719
Proportion 0.367 0.295 0.165 0.172
--------------------------------------------------------------------------------
# On remarque donc que les deux classes sont TargetB et TargetD
# TargetB a deux valeurs certainement que cela doit être un binaire pour savoir si une personne répond ou non
# Target D doit représenter la somme donnée si celui-ci à donné
unique(data.train$TARGET_B)
# 0 doit être le nombre de gens qui n'ont pas donné et 1 les gens qui ont donné
proportionB = prop.table(table(data.train$TARGET_B))
proportionB
0 1
0.9491 0.0509
On remarque que la majorité des personnes (Quasi-95%) font partie de la classe 0.
Regardons maintenant dans le nombre de personnes qui ont donné (c'est-à-dire que TARGET_D est supérieur à 0) les statistiques concernant leurs dons.
personneQuiOntDonne = data.train[data.train$TARGET_D >0,]
# On regarde le nombre de dons au cas ou TARGETB aurait été mal rempli
donsDesPersonnes = personneQuiOntDonne$TARGET_D
# Regardons maintenant combien de personnes ont donné cela correspond bien à la proportion des gens qui ont donné
dim(personneQuiOntDonne)
summary(donsDesPersonnes)
Min. 1st Qu. Median Mean 3rd Qu. Max. 1.00 10.00 12.00 14.58 20.00 100.00
On remarque que les dons sont compris entre 1 et 100 mais 75% font moins de 20€ et les restes sont des dons entre 20 et 100.
Prenons maintenant une représentation graphique pour mieux voir ces chiffres.
boxplot(donsDesPersonnes)
Sur le graphique ci-dessus on voit justement qu'une très grande partie des dons font moins de 55€s seuls deux dons sont aux dessus un don à 60 et celui à 100.
Essayons un graphique plus précis pour savoir quelle est la somme la plus donnée.
barplot(table(round(donsDesPersonnes)))
On arrondit car le don fait par la personne n'est pas forcément entier (11€50 par exemple). On remarque dans ce graphique que la somme la plus donnée et 10€ ce graphique semblé aussi en accord avec celui sur 75% des dons font moins de 20% et très peut font plus de 50 €.
# Avant de commencer à travailler avec les données nous voyons que beaucoup de données sont manquantes
# Nous allons donc les remplacer par leurs valeurs médianes
for(i in 1:ncol(data.train)){
data.train[is.na(data.train[,i]), i] <- median(data.train[,i], na.rm = TRUE)
}
variablesNumerique = which(sapply(data.train,is.numeric))
myHist <- function(x){
hist(data.train[,x],main=NULL,xlab=x)
}
sapply(names(variablesNumerique),myHist)
Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA" Warning message in mean.default(sort(x, partial = half + 0L:1L)[half + 0L:1L]): "l'argument n'est ni numérique, ni logique : renvoi de NA"
| ODATEDW | TCODE | DOB | CLUSTER | AGE | NUMCHLD | INCOME | WEALTH1 | HIT | MBCRAFT | ... | FISTDATE | NEXTDATE | TIMELAG | AVGGIFT | CONTROLN | TARGET_B | TARGET_D | HPHONE_D | RFA_2F | CLUSTER2 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| <int> | <int> | <dbl> | <dbl> | <chr> | <lgl> | <dbl> | <int> | <dbl> | <dbl> | ... | <dbl> | <dbl> | <chr> | <lgl> | <dbl> | <int> | <dbl> | <dbl> | <chr> | <lgl> | |
| breaks | 8600, 8700, 8800, 8900, 9000, 9100, 9200, 9300, 9400, 9500, 9600, 9700, 9800 | 0, 5000, 10000, 15000, 20000, 25000, 30000, 35000, 40000, 45000, 50000, 55000, 60000, 65000, 70000, 75000 | 0, 500, 1000, 1500, 2000, 2500, 3000, 3500, 4000, 4500, 5000, 5500, 6000, 6500, 7000, 7500, 8000, 8500 | 0, 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55 | 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100 | 1.0, 1.2, 1.4, 1.6, 1.8, 2.0, 2.2, 2.4, 2.6, 2.8, 3.0, 3.2, 3.4, 3.6, 3.8, 4.0, 4.2, 4.4, 4.6, 4.8, 5.0 | 1.0, 1.5, 2.0, 2.5, 3.0, 3.5, 4.0, 4.5, 5.0, 5.5, 6.0, 6.5, 7.0 | 0.0, 0.5, 1.0, 1.5, 2.0, 2.5, 3.0, 3.5, 4.0, 4.5, 5.0, 5.5, 6.0, 6.5, 7.0, 7.5, 8.0, 8.5, 9.0 | 0, 20, 40, 60, 80, 100, 120, 140, 160, 180, 200, 220, 240 | 0.0, 0.5, 1.0, 1.5, 2.0, 2.5, 3.0, 3.5, 4.0, 4.5, 5.0, 5.5, 6.0 | ... | 7200, 7400, 7600, 7800, 8000, 8200, 8400, 8600, 8800, 9000, 9200, 9400, 9600, 9800 | 7200, 7400, 7600, 7800, 8000, 8200, 8400, 8600, 8800, 9000, 9200, 9400, 9600, 9800 | 0, 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90 | 0, 20, 40, 60, 80, 100, 120, 140, 160, 180, 200, 220 | 0, 10000, 20000, 30000, 40000, 50000, 60000, 70000, 80000, 90000, 100000, 110000, 120000, 130000, 140000, 150000, 160000, 170000, 180000, 190000, 200000 | 0.00, 0.05, 0.10, 0.15, 0.20, 0.25, 0.30, 0.35, 0.40, 0.45, 0.50, 0.55, 0.60, 0.65, 0.70, 0.75, 0.80, 0.85, 0.90, 0.95, 1.00 | 0, 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100 | 0.00, 0.05, 0.10, 0.15, 0.20, 0.25, 0.30, 0.35, 0.40, 0.45, 0.50, 0.55, 0.60, 0.65, 0.70, 0.75, 0.80, 0.85, 0.90, 0.95, 1.00 | 1.0, 1.2, 1.4, 1.6, 1.8, 2.0, 2.2, 2.4, 2.6, 2.8, 3.0, 3.2, 3.4, 3.6, 3.8, 4.0 | 0, 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65 |
| counts | 1544, 369, 689, 534, 803, 925, 787, 392, 1325, 1597, 1034, 1 | 9991, 0, 1, 0, 0, 7, 0, 0, 0, 0, 0, 0, 0, 0, 1 | 2555, 212, 459, 646, 817, 735, 699, 576, 682, 722, 696, 536, 384, 183, 83, 12, 3 | 616, 694, 1200, 898, 988, 1238, 801, 1268, 1089, 756, 452 | 1, 7, 24, 143, 241, 465, 573, 807, 646, 648, 3035, 758, 771, 781, 574, 333, 155, 38 | 9512, 0, 0, 0, 329, 0, 0, 0, 0, 113, 0, 0, 0, 0, 38, 0, 0, 0, 0, 8 | 960, 1441, 0, 893, 0, 3508, 0, 1632, 0, 793, 0, 773 | 260, 338, 0, 421, 0, 481, 0, 529, 0, 523, 0, 5244, 0, 641, 0, 742, 0, 821 | 9701, 256, 36, 1, 0, 0, 0, 0, 0, 0, 0, 6 | 9496, 386, 0, 88, 0, 26, 0, 3, 0, 0, 0, 1 | ... | 1, 0, 1, 2, 2, 3, 6, 1755, 1362, 1728, 1788, 2804, 548 | 1, 0, 0, 2, 2, 3, 6, 1337, 1281, 1650, 2716, 2672, 330 | 3816, 3816, 1543, 389, 256, 105, 28, 30, 9, 5, 0, 1, 1, 0, 0, 0, 0, 1 | 8929, 963, 79, 12, 10, 3, 0, 1, 0, 2, 1 | 483, 534, 553, 525, 506, 527, 551, 510, 528, 462, 553, 516, 532, 492, 527, 567, 509, 511, 532, 82 | 9491, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 509 | 9564, 152, 109, 91, 55, 12, 2, 3, 3, 6, 1, 1, 0, 0, 0, 0, 0, 0, 0, 1 | 4882, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 5118 | 5056, 0, 0, 0, 2186, 0, 0, 0, 0, 1569, 0, 0, 0, 0, 1189 | 1007, 558, 1126, 801, 733, 586, 805, 751, 758, 595, 830, 1109, 341 |
| density | 0.001544, 0.000369, 0.000689, 0.000534, 0.000803, 0.000925, 0.000787, 0.000392, 0.001325, 0.001597, 0.001034, 0.000001 | 0.00019982, 0.00000000, 0.00000002, 0.00000000, 0.00000000, 0.00000014, 0.00000000, 0.00000000, 0.00000000, 0.00000000, 0.00000000, 0.00000000, 0.00000000, 0.00000000, 0.00000002 | 0.0005110, 0.0000424, 0.0000918, 0.0001292, 0.0001634, 0.0001470, 0.0001398, 0.0001152, 0.0001364, 0.0001444, 0.0001392, 0.0001072, 0.0000768, 0.0000366, 0.0000166, 0.0000024, 0.0000006 | 0.01232, 0.01388, 0.02400, 0.01796, 0.01976, 0.02476, 0.01602, 0.02536, 0.02178, 0.01512, 0.00904 | 0.00002, 0.00014, 0.00048, 0.00286, 0.00482, 0.00930, 0.01146, 0.01614, 0.01292, 0.01296, 0.06070, 0.01516, 0.01542, 0.01562, 0.01148, 0.00666, 0.00310, 0.00076 | 4.7560, 0.0000, 0.0000, 0.0000, 0.1645, 0.0000, 0.0000, 0.0000, 0.0000, 0.0565, 0.0000, 0.0000, 0.0000, 0.0000, 0.0190, 0.0000, 0.0000, 0.0000, 0.0000, 0.0040 | 0.1920, 0.2882, 0.0000, 0.1786, 0.0000, 0.7016, 0.0000, 0.3264, 0.0000, 0.1586, 0.0000, 0.1546 | 0.0520, 0.0676, 0.0000, 0.0842, 0.0000, 0.0962, 0.0000, 0.1058, 0.0000, 0.1046, 0.0000, 1.0488, 0.0000, 0.1282, 0.0000, 0.1484, 0.0000, 0.1642 | 0.048505, 0.001280, 0.000180, 0.000005, 0.000000, 0.000000, 0.000000, 0.000000, 0.000000, 0.000000, 0.000000, 0.000030 | 1.8992, 0.0772, 0.0000, 0.0176, 0.0000, 0.0052, 0.0000, 0.0006, 0.0000, 0.0000, 0.0000, 0.0002 | ... | 0.0000005, 0.0000000, 0.0000005, 0.0000010, 0.0000010, 0.0000015, 0.0000030, 0.0008775, 0.0006810, 0.0008640, 0.0008940, 0.0014020, 0.0002740 | 0.0000005, 0.0000000, 0.0000000, 0.0000010, 0.0000010, 0.0000015, 0.0000030, 0.0006685, 0.0006405, 0.0008250, 0.0013580, 0.0013360, 0.0001650 | 0.07632, 0.07632, 0.03086, 0.00778, 0.00512, 0.00210, 0.00056, 0.00060, 0.00018, 0.00010, 0.00000, 0.00002, 0.00002, 0.00000, 0.00000, 0.00000, 0.00000, 0.00002 | 0.044645, 0.004815, 0.000395, 0.000060, 0.000050, 0.000015, 0.000000, 0.000005, 0.000000, 0.000010, 0.000005 | 4.83e-06, 5.34e-06, 5.53e-06, 5.25e-06, 5.06e-06, 5.27e-06, 5.51e-06, 5.10e-06, 5.28e-06, 4.62e-06, 5.53e-06, 5.16e-06, 5.32e-06, 4.92e-06, 5.27e-06, 5.67e-06, 5.09e-06, 5.11e-06, 5.32e-06, 8.20e-07 | 18.982, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 1.018 | 0.19128, 0.00304, 0.00218, 0.00182, 0.00110, 0.00024, 0.00004, 0.00006, 0.00006, 0.00012, 0.00002, 0.00002, 0.00000, 0.00000, 0.00000, 0.00000, 0.00000, 0.00000, 0.00000, 0.00002 | 9.764, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 0.000, 10.236 | 2.5280, 0.0000, 0.0000, 0.0000, 1.0930, 0.0000, 0.0000, 0.0000, 0.0000, 0.7845, 0.0000, 0.0000, 0.0000, 0.0000, 0.5945 | 0.02014, 0.01116, 0.02252, 0.01602, 0.01466, 0.01172, 0.01610, 0.01502, 0.01516, 0.01190, 0.01660, 0.02218, 0.00682 |
| mids | 8650, 8750, 8850, 8950, 9050, 9150, 9250, 9350, 9450, 9550, 9650, 9750 | 2500, 7500, 12500, 17500, 22500, 27500, 32500, 37500, 42500, 47500, 52500, 57500, 62500, 67500, 72500 | 250, 750, 1250, 1750, 2250, 2750, 3250, 3750, 4250, 4750, 5250, 5750, 6250, 6750, 7250, 7750, 8250 | 2.5, 7.5, 12.5, 17.5, 22.5, 27.5, 32.5, 37.5, 42.5, 47.5, 52.5 | 12.5, 17.5, 22.5, 27.5, 32.5, 37.5, 42.5, 47.5, 52.5, 57.5, 62.5, 67.5, 72.5, 77.5, 82.5, 87.5, 92.5, 97.5 | 1.1, 1.3, 1.5, 1.7, 1.9, 2.1, 2.3, 2.5, 2.7, 2.9, 3.1, 3.3, 3.5, 3.7, 3.9, 4.1, 4.3, 4.5, 4.7, 4.9 | 1.25, 1.75, 2.25, 2.75, 3.25, 3.75, 4.25, 4.75, 5.25, 5.75, 6.25, 6.75 | 0.25, 0.75, 1.25, 1.75, 2.25, 2.75, 3.25, 3.75, 4.25, 4.75, 5.25, 5.75, 6.25, 6.75, 7.25, 7.75, 8.25, 8.75 | 10, 30, 50, 70, 90, 110, 130, 150, 170, 190, 210, 230 | 0.25, 0.75, 1.25, 1.75, 2.25, 2.75, 3.25, 3.75, 4.25, 4.75, 5.25, 5.75 | ... | 7300, 7500, 7700, 7900, 8100, 8300, 8500, 8700, 8900, 9100, 9300, 9500, 9700 | 7300, 7500, 7700, 7900, 8100, 8300, 8500, 8700, 8900, 9100, 9300, 9500, 9700 | 2.5, 7.5, 12.5, 17.5, 22.5, 27.5, 32.5, 37.5, 42.5, 47.5, 52.5, 57.5, 62.5, 67.5, 72.5, 77.5, 82.5, 87.5 | 10, 30, 50, 70, 90, 110, 130, 150, 170, 190, 210 | 5000, 15000, 25000, 35000, 45000, 55000, 65000, 75000, 85000, 95000, 105000, 115000, 125000, 135000, 145000, 155000, 165000, 175000, 185000, 195000 | 0.025, 0.075, 0.125, 0.175, 0.225, 0.275, 0.325, 0.375, 0.425, 0.475, 0.525, 0.575, 0.625, 0.675, 0.725, 0.775, 0.825, 0.875, 0.925, 0.975 | 2.5, 7.5, 12.5, 17.5, 22.5, 27.5, 32.5, 37.5, 42.5, 47.5, 52.5, 57.5, 62.5, 67.5, 72.5, 77.5, 82.5, 87.5, 92.5, 97.5 | 0.025, 0.075, 0.125, 0.175, 0.225, 0.275, 0.325, 0.375, 0.425, 0.475, 0.525, 0.575, 0.625, 0.675, 0.725, 0.775, 0.825, 0.875, 0.925, 0.975 | 1.1, 1.3, 1.5, 1.7, 1.9, 2.1, 2.3, 2.5, 2.7, 2.9, 3.1, 3.3, 3.5, 3.7, 3.9 | 2.5, 7.5, 12.5, 17.5, 22.5, 27.5, 32.5, 37.5, 42.5, 47.5, 52.5, 57.5, 62.5 |
| xname | data.train[, x] | data.train[, x] | data.train[, x] | data.train[, x] | data.train[, x] | data.train[, x] | data.train[, x] | data.train[, x] | data.train[, x] | data.train[, x] | ... | data.train[, x] | data.train[, x] | data.train[, x] | data.train[, x] | data.train[, x] | data.train[, x] | data.train[, x] | data.train[, x] | data.train[, x] | data.train[, x] |
| equidist | TRUE | TRUE | TRUE | TRUE | TRUE | TRUE | TRUE | TRUE | TRUE | TRUE | ... | TRUE | TRUE | TRUE | TRUE | TRUE | TRUE | TRUE | TRUE | TRUE | TRUE |
Sur toutes les variables ci-dessous une attire notre attention la variable HIT en effet elle semble avoir quelques valeurs aberrantes bien aux dessus de toutes les valeurs (nous allons donc remplacer ces valeurs par la moyenne des valeurs non aberrantes).
boxplot(data.train$HIT)
valeursAberante = data.train$HIT[data.train$HIT>100]
valeursAberante
valeursNonAberante = data.train$HIT[data.train$HIT<100]
mean(valeursNonAberante)
data.train$HIT[data.train$HIT>100] = mean(valeursNonAberante)
boxplot(data.train$HIT)
Après ces modifications nous pouvons considérer ces données comme bonnes.
Maintenant étudions l'âge des personnes qui ont donné.
AgeNumeriser <- cut(personneQuiOntDonne$AGE, right=F, breaks=seq(0, 100, by=5))
boxplot(personneQuiOntDonne$TARGET_D ∼ AgeNumeriser,ylim=c(0,40))
On remarque que la majorité des personnes qui donne ont entre 30 et 95 ans. Après avoir étudié l'âge des personnes qui ont donné intéressons-nous aux sexes des personnes.
boxplot(personneQuiOntDonne$TARGET_D ∼ personneQuiOntDonne$GENDER, ylim=c(0,80))
Tous les sexes sont représentés dans les dons.
L'une des dernières choses à faire et de regarder la corrélation entre les variables et notamment des variables avec les dons effectués pour savoir si une variable influe bien plus que les autres sur les dons ou si toute la variable joue a peu près le même rôle).
correlation = cor(data.train$TARGET_D,data.train[,variablesNumerique],use = "pairwise.complete.obs")
# pairwise.complete.obs permet d'éviter une erreur en cas de valeurs manquantes
correlation = abs(correlation)
# Étudier les valeurs absolues nous permettent de voir lesquels
# sont les plus corrélés négativement ou positivement.
(correlation <- correlation[,order(correlation, decreasing=T)])
Warning message in cor(data.train$TARGET_D, data.train[, variablesNumerique], use = "pairwise.complete.obs"): "l'écart type est nul"
On remarque qu'il y a des variables plus corrélés que les autres les deux premières TARGET B et D ne compte pas car ce sont les classes. Mais on voit que certaines ont des variables très corrélé (taux à 0.35 par exemple) là ou d'autres variables sont presque à 0.
Utilisons pour ça la librairie Party
install.packages("party")
Installation du package dans 'C:/Users/alex5/OneDrive/Documents/R/win-library/4.1' (car 'lib' n'est pas spécifié)
package 'party' successfully unpacked and MD5 sums checked The downloaded binary packages are in C:\Users\alex5\AppData\Local\Temp\RtmpgnoXDC\downloaded_packages
library(party)
Le chargement a nécessité le package : grid
Le chargement a nécessité le package : mvtnorm
Le chargement a nécessité le package : modeltools
Le chargement a nécessité le package : stats4
Le chargement a nécessité le package : strucchange
Le chargement a nécessité le package : zoo
Attachement du package : 'zoo'
Les objets suivants sont masqués depuis 'package:base':
as.Date, as.Date.numeric
Le chargement a nécessité le package : sandwich
set.seed(1234)
ind = sample(2,nrow(data.train),replace = TRUE, prob = c(0.8,0.2))
train.data = data.train[ind == 1,]
test.data = data.train[ind ==2,]
# Paramétre de l'arbre
MinSplit = 200
MinBucket = 40
MaxDepth = 10
# Variables à garder choisi en fonction des corrélations et d'autres variables qu'il semble logique d'utiliser
VariableAUtiliser <- c("HV1","HV2","HV3","HVP4","ETH10","IC12","IC1","HV4","IC3","IC2","HVP3","HVP6","IC4",
"AGE", "AVGGIFT", "CARDGIFT", "CARDGIFT",
"CLUSTER2","HIT","HPHONE_D", "INCOME", "LASTGIFT", "MAXRAMNT",
"MINRAMNT", "NGIFTALL", "NUMPRM12", "RAMNTALL","RFA_2F", "TIMELAG")
DonneeEntrainement = train.data[,c("TARGET_D",VariableAUtiliser)]
dim(DonneeEntrainement)
describe(DonneeEntrainement)
# Remplacons les valeurs manquantes dans chaque colonne par la médianne des autres valeurs
# (pas la moyenne car il y a des données avec un nombre finie de valeurs)
DonneeEntrainement
30 Variables 8014 Observations
--------------------------------------------------------------------------------
TARGET_D
n missing distinct Info Mean Gmd .05 .10
8014 0 43 0.149 0.7699 1.485 0 0
.25 .50 .75 .90 .95
0 0 0 0 4
lowest : 0.0 1.0 2.0 2.5 3.0, highest: 41.0 45.0 50.0 60.0 100.0
--------------------------------------------------------------------------------
HV1
n missing distinct Info Mean Gmd .05 .10
8014 0 2266 1 1053 851.2 299.0 372.0
.25 .50 .75 .90 .95
512.0 737.5 1190.5 2189.7 3020.0
lowest : 0 75 98 109 120, highest: 5965 5976 5980 5983 5985
--------------------------------------------------------------------------------
HV2
n missing distinct Info Mean Gmd .05 .10
8014 0 2372 1 1122 881.6 332.7 409.0
.25 .50 .75 .90 .95
555.2 805.0 1303.0 2302.2 3183.0
lowest : 0 90 125 137 143, highest: 5928 5932 5933 5950 5955
--------------------------------------------------------------------------------
HV3
n missing distinct Info Mean Gmd .05 .10
8014 0 14 0.973 4.214 2.443 2 2
.25 .50 .75 .90 .95
3 4 5 7 9
lowest : 0 1 2 3 4, highest: 9 10 11 12 13
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 84 272 1503 1775 1543 1016 675 419 240 194 91
Proportion 0.010 0.034 0.188 0.221 0.193 0.127 0.084 0.052 0.030 0.024 0.011
Value 11 12 13
Frequency 86 101 15
Proportion 0.011 0.013 0.002
--------------------------------------------------------------------------------
HVP4
n missing distinct Info Mean Gmd .05 .10
8014 0 100 0.998 50.95 41.77 1 4
.25 .50 .75 .90 .95
14 48 92 99 99
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
ETH10
n missing distinct Info Mean Gmd .05 .10
8014 0 16 0.365 0.2247 0.4059 0 0
.25 .50 .75 .90 .95
0 0 0 1 1
lowest : 0 1 2 3 4, highest: 12 13 14 17 22
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 6886 809 192 53 23 21 12 6 4 2 1
Proportion 0.859 0.101 0.024 0.007 0.003 0.003 0.001 0.001 0.000 0.000 0.000
Value 12 13 14 17 22
Frequency 1 1 1 1 1
Proportion 0.000 0.000 0.000 0.000 0.000
--------------------------------------------------------------------------------
IC12
n missing distinct Info Mean Gmd .05 .10
8014 0 27 0.922 2.162 2.958 0 0
.25 .50 .75 .90 .95
0 1 3 6 9
lowest : 0 1 2 3 4, highest: 22 23 24 26 31
--------------------------------------------------------------------------------
IC1
n missing distinct Info Mean Gmd .05 .10
8014 0 773 1 338 168 146.0 178.0
.25 .50 .75 .90 .95
231.0 309.0 411.0 532.0 626.3
lowest : 0 50 54 56 59, highest: 1342 1347 1368 1395 1500
--------------------------------------------------------------------------------
HV4
n missing distinct Info Mean Gmd .05 .10
8014 0 14 0.977 3.873 2.451 1 1
.25 .50 .75 .90 .95
2 3 5 7 8
lowest : 0 1 2 3 4, highest: 9 10 11 12 13
Value 0 1 2 3 4 5 6 7 8 9 10
Frequency 116 814 1566 1545 1357 945 618 430 283 160 113
Proportion 0.014 0.102 0.195 0.193 0.169 0.118 0.077 0.054 0.035 0.020 0.014
Value 11 12 13
Frequency 50 16 1
Proportion 0.006 0.002 0.000
--------------------------------------------------------------------------------
IC3
n missing distinct Info Mean Gmd .05 .10
8014 0 798 1 384.7 170.4 196 225
.25 .50 .75 .90 .95
277 353 462 590 694
lowest : 0 25 41 62 68, highest: 1145 1153 1157 1199 1226
--------------------------------------------------------------------------------
IC2
n missing distinct Info Mean Gmd .05 .10
8014 0 825 1 384.9 178.1 185 218
.25 .50 .75 .90 .95
273 355 464 586 684
lowest : 0 50 54 56 59, highest: 1395 1434 1442 1493 1500
--------------------------------------------------------------------------------
HVP3
n missing distinct Info Mean Gmd .05 .10
8014 0 100 0.998 34.84 39.53 0 1
.25 .50 .75 .90 .95
3 17 67 97 99
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
HVP6
n missing distinct Info Mean Gmd .05 .10
8014 0 100 0.727 6.216 11.07 0 0
.25 .50 .75 .90 .95
0 0 1 17 50
lowest : 0 1 2 3 4, highest: 95 96 97 98 99
--------------------------------------------------------------------------------
IC4
n missing distinct Info Mean Gmd .05 .10
8014 0 844 1 428.2 182.1 224.7 258.0
.25 .50 .75 .90 .95
314.0 395.5 511.0 652.0 753.0
lowest : 0 71 75 80 85, highest: 1231 1239 1246 1252 1272
--------------------------------------------------------------------------------
AGE
n missing distinct Info Mean Gmd .05 .10
8014 0 81 0.982 61.89 16.13 37 42
.25 .50 .75 .90 .95
52 62 72 81 86
lowest : 16 18 20 21 22, highest: 94 95 96 97 98
--------------------------------------------------------------------------------
AVGGIFT
n missing distinct Info Mean Gmd .05 .10
8014 0 1995 1 13.21 7.427 5.00 6.10
.25 .50 .75 .90 .95
8.40 11.60 15.33 21.00 25.00
lowest : 1.574324 2.071429 2.125000 2.205882 2.260870
highest: 100.000000 103.000000 191.666667 200.000000 215.625000
--------------------------------------------------------------------------------
CARDGIFT
n missing distinct Info Mean Gmd .05 .10
8014 0 30 0.987 4.973 4.678 0 1
.25 .50 .75 .90 .95
2 4 7 11 14
lowest : 0 1 2 3 4, highest: 25 26 27 29 32
--------------------------------------------------------------------------------
CARDGIFT.1
n missing distinct Info Mean Gmd .05 .10
8014 0 30 0.987 4.973 4.678 0 1
.25 .50 .75 .90 .95
2 4 7 11 14
lowest : 0 1 2 3 4, highest: 25 26 27 29 32
--------------------------------------------------------------------------------
CLUSTER2
n missing distinct Info Mean Gmd .05 .10
8014 0 62 1 31.51 21.51 4.0 5.0
.25 .50 .75 .90 .95
15.0 32.0 49.0 57.7 60.0
lowest : 1 2 3 4 5, highest: 58 59 60 61 62
--------------------------------------------------------------------------------
HIT
n missing distinct Info Mean Gmd .05 .10
8014 0 57 0.803 3.052 4.9 0 0
.25 .50 .75 .90 .95
0 0 3 10 16
lowest : 0.000000 1.000000 2.000000 3.000000 3.044227
highest: 52.000000 53.000000 59.000000 60.000000 65.000000
--------------------------------------------------------------------------------
HPHONE_D
n missing distinct Info Sum Mean Gmd
8014 0 2 0.75 4104 0.5121 0.4998
--------------------------------------------------------------------------------
INCOME
n missing distinct Info Mean Gmd
8014 0 7 0.948 3.891 1.825
lowest : 1 2 3 4 5, highest: 3 4 5 6 7
Value 1 2 3 4 5 6 7
Frequency 757 1171 722 2790 1309 639 626
Proportion 0.094 0.146 0.090 0.348 0.163 0.080 0.078
--------------------------------------------------------------------------------
LASTGIFT
n missing distinct Info Mean Gmd .05 .10
8014 0 79 0.984 17.22 9.96 5 7
.25 .50 .75 .90 .95
10 15 20 25 30
lowest : 0 1 2 3 4, highest: 130 150 250 275 500
--------------------------------------------------------------------------------
MAXRAMNT
n missing distinct Info Mean Gmd .05 .10
8014 0 94 0.982 19.78 10.61 9 10
.25 .50 .75 .90 .95
15 17 22 30 40
lowest : 5.00 5.25 6.00 7.00 8.00, highest: 156.00 200.00 250.00 275.00 500.00
--------------------------------------------------------------------------------
MINRAMNT
n missing distinct Info Mean Gmd .05 .10
8014 0 56 0.94 7.881 6.555 2 3
.25 .50 .75 .90 .95
3 5 10 20 20
lowest : 0.00 0.01 0.02 0.05 0.06, highest: 59.00 60.00 88.14 100.00 103.00
--------------------------------------------------------------------------------
NGIFTALL
n missing distinct Info Mean Gmd .05 .10
8014 0 63 0.996 9.483 8.576 1 1
.25 .50 .75 .90 .95
3 7 13 20 25
lowest : 1 2 3 4 5, highest: 69 71 84 91 95
--------------------------------------------------------------------------------
NUMPRM12
n missing distinct Info Mean Gmd .05 .10
8014 0 47 0.972 12.84 3.851 8 9
.25 .50 .75 .90 .95
11 12 13 15 23
lowest : 2 3 4 5 6, highest: 45 46 48 50 53
--------------------------------------------------------------------------------
RAMNTALL
n missing distinct Info Mean Gmd .05 .10
8014 0 590 1 102 88.39 20 25
.25 .50 .75 .90 .95
40 76 130 202 270
lowest : 15.00 16.00 17.00 18.00 19.00
highest: 1248.07 1367.00 1435.00 1725.00 2038.00
--------------------------------------------------------------------------------
RFA_2F
n missing distinct Info Mean Gmd
8014 0 4 0.853 1.883 1.104
Value 1 2 3 4
Frequency 4073 1732 1282 927
Proportion 0.508 0.216 0.160 0.116
--------------------------------------------------------------------------------
TIMELAG
n missing distinct Info Mean Gmd .05 .10
8014 0 49 0.992 7.847 5.811 2 2
.25 .50 .75 .90 .95
4 6 10 14 19
lowest : 0 1 2 3 4, highest: 46 47 48 56 62
--------------------------------------------------------------------------------
formula = TARGET_D ~ HV1+HV2+HV3+HVP4+ETH10+IC12+IC1+HV4+IC3+IC2+HVP3+HVP6+IC4+AGE+AVGGIFT+CARDGIFT+CARDGIFT+CLUSTER2+HIT+HPHONE_D+INCOME+LASTGIFT+MAXRAMNT+MINRAMNT+NGIFTALL+NUMPRM12+RAMNTALL+RFA_2F+TIMELAG
Arbre <- ctree(formula,data=DonneeEntrainement,
controls=ctree_control(minsplit=MinSplit, minbucket=MinBucket, maxdepth=MaxDepth))
plot(Arbre, type="simple",
ip_args=list(pval=FALSE), ep_args=list(digits=0,abbreviate=TRUE),
tp_args=list(digits=2))
testPred = predict(Arbre,newdata = test.data)
plot(testPred)
plot(test.data$TARGET_D)
table(testPred, test.data$TARGET_D)
testPred 0 3 4 5 6 7 9 10 11 12 13 14
0.591823193542036 1200 1 1 6 0 2 0 14 2 4 1 1
1.01289173789174 657 0 1 3 1 0 1 9 1 0 0 0
2.51968503937008 40 0 0 1 0 0 0 0 0 1 0 0
testPred 15 16 18 20 21 22 23 25 30 51
0.591823193542036 9 0 0 7 1 1 2 5 1 0
1.01289173789174 2 3 0 1 0 0 1 4 0 1
2.51968503937008 0 0 1 0 0 0 0 0 0 0
On remarque que notre modèle n'est pas du tout optimisé dû à certaines choses comme le manque d'informations sur ceux que représentent les variables. Le manque de temps pour analyser précisément toutes les données et trouver des modèles plus performants, la non-possibilité d'inclure toutes les variables pour entrainer l'arbre cela aurait donné un arbre plus performant sur les predictions.
Il faudrait essayer d'utiliser d'autres méthodes de régression pour comparer leurs performances par rapport aux arbres.